Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.

Slides:



Advertisements
Similar presentations
IDENTIFYING CAUSAL GENES AND DYSREGULATED PATHWAYS IN COMPLEX DISEASES Nov. 6 th, 2010 YOO-AH KIM NIH / NLM / NCBI.
Advertisements

SI0030 Social Research Methods Week 6 Luke Sloan
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Automated Regression Modeling Descriptive vs. Predictive Regression Models Four common automated modeling procedures Forward Modeling Backward Modeling.
Differential Expression Analysis Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
Inferring Biologically Meaningful Relationships Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
Introduction to genomes & genome browsers
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Putting genetic interactions in context through a global modular decomposition Jamal.
What is Interaction for A Binary Outcome? Chun Li Department of Biostatistics Center for Human Genetics Research September 19, 2007.
Soc 3306a: Path Analysis Using Multiple Regression and Path Analysis to Model Causality.
DEPARTMENT OF STATISTICS Stats Questions We Are Often Asked.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Population Approaches to Detecting and Genotyping Copy Number Variation Lachlan Coin July 2010.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Applied statistics Katrin Jaedicke
STATISTICS TUTORIAL Applied Research In Organizational Behavior By: Dr. Goli Sadri.
Introduction Integrative Analysis of Genomic Variants in Carcinogenesis Syed Haider, Arek Kasprzyk, Pietro Lio Artificial Intelligence and Computational.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Statistics Micro Mini Threats to Your Experiment!
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Data Analysis Statistics. Inferential statistics.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Introduction to Glioblastoma Chris Plaisier Introduction to Systems Biology Course Institute for Systems Biology.
T-tests and ANOVA Statistical analysis of group differences.
Soc 3306a Lecture 8: Multivariate 1 Using Multiple Regression and Path Analysis to Model Causality.
Multiple Choice Questions for discussion
Geuvadis RNAseq analysis at UNIGE Analysis plans
1 Example 1.0: Indirect Effects and the Test of Mediation.
Radiogenomics in glioblastoma multiforme
MANAGEMENT AND ANALYSIS OF WILDLIFE BIOLOGY DATA Bret A. Collier 1 and T. Wayne Schwertner 2 1 Institute of Renewable Natural Resources, Texas A&M University,
Expanded PLA2G6 Copy Number Variant Analysis in Patients with Infantile Neuroaxonal Dystrophy (INAD) Danielle Crompton, P. K. Rehal, L. MacPherson, K.
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
Inferential statistics. Why statistics are important Statistics are concerned with difference – how much does one feature of an environment differ.
Chapter 18 Some Other (Important) Statistical Procedures You Should Know About Part IV Significantly Different: Using Inferential Statistics.
Head and Neck Cancer: microRNA analysis
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
Inferring transcriptional and microRNA-mediated regulatory programs in glioblastma Setty, M., et al.
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Regression. Population Covariance and Correlation.
Next Colin Clarke-Hill and Ismo Kuhanen 1 Analysing Quantitative Data 1 Forming the Hypothesis Inferential Methods - an overview Research Methods Analysing.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
I271B The t distribution and the independent sample t-test.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
 A llele frequencies will remain constant unless one or more factors cause the frequencies to change.  If there is no change, there is no evolving.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Introduction to connectivity: Psychophysiological Interactions Roland Benoit MfD 2007/8.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
1 Virtual COMSATS Inferential Statistics Lecture-25 Ossam Chohan Assistant Professor CIIT Abbottabad.
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
Experimental design and power analysis for systems biology Karlyn Beer 21 Aug 2012.
GO enrichment and GOrilla
Practical Statistics Regression. There are six statistics that will answer 90% of all questions! 1. Descriptive 2. Chi-square 3. Z-tests 4. Comparison.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Chapter 5: Introductory Linear Regression
Title: Assign Pathways to Gene Set June 21, 2007 Guanming Wu.
Data Analysis: Statistics for Item Interactions. Purpose To provide a broad overview of statistical analyses appropriate for exploring interactions and.
Accuracy, Reliability, and Validity of Freesurfer Measurements David H. Salat
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 16 : Summary Marshall University Genomics Core Facility.
Inferential Statistics Psych 231: Research Methods in Psychology.
by Andrea J. O'Hara, Ling Wang, Bruce J. Dezube, William J
Regression Analysis.
Label propagation algorithm
Business Statistics - QBM117
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology

Impetus for Tutorial

Searching for Amplified miRNAs Region on chromosome 12 containing hsa-miR- 26a is amplified in glioma This region also contains a few other genes including the genes encapsulating hsa-miR-26a

Screen for miRNAs Correlated with CNV Which miRNA’s expression is modified by CNVs? – hsa-miR-26a = Amplified – hsa-miR-491 = Deleted – hsa-miR-339 = Amplified and deleted in sub-clonal populations

Correlation of miRNA Expression and Copy Number Variation

Comparison of Approaches We identified other miRNA not reported by Kim, et al – Our screening approach yielded more putatively dysregulated miRNAs – Cherry picking can be useful Typically need to follow rigorous statistical protocols for initial screen After initial screen multiple paths to publication – Up to expertise of researcher

How could we make it better? User linear regression and add covariates: – Sex – Age –... Others?

Gene Affected by miRNA Perturbation Applied biologically motivated filtering of putative genes targeted by miRNA to those predicted by PITA – Complementarity of miRNA to 3’ UTR – … Tested for correlation between miRNA to putative target genes

Correlation of miRNA Expression with Putative Target Genes

Correlation with CNV We took it one step further and tested for a correlation between the CNV and miRNA target genes Only one gene for hsa- miR-26a that was associated with the miRNA expression was not associated with hsa- miR-26a CNV

How could we make it better? Use linear regression and integrate strength of PITA match Others?

Causality? Purpose of correlation with CNV is that we wanted to make sure the effect of the miRNA on the target genes was through the CNV Can we really say for sure the direction of the association? – Can we imply causality?

Consistent with Causality While not a test for causality the fact that conditioning miRNA target gene expression on hsa-miR-26a expression significantly reduces the association is consistent with a causal model There are ways we could model causal information flow – – 89.html 89.html

Integration with Clinical Phenotypes Case / Control – Student’s T-Test Correlation / Regression Survival analysis – Need time to event and status at end of study

Glioma miRNAs Case/Control

hsa-miR-26a Amplification Significantly Associated with Lower Survival

Summary We repeated analysis done by Kim, et al. You learned how to calculate a correlation and Student’s T-test in R You learned how to correct for multiple testing You learned how to go from a global study to specific hypotheses for testing