1 Inference of Regulatory Networks via Systems Genetics Ina Hoeschele.

Slides:



Advertisements
Similar presentations
Fossils provide a record of evolution.
Advertisements

Lecture 39 Prof Duncan Shaw. Meiosis and Recombination Chromosomes pair upDNA replication Chiasmata form Recombination 1st cell division 2nd cell divisionGametes.
Introduction to Evolution
Biology 102 Biotechnology.
Lecture 3: Jan. 25 Transmission genetics: independent assortment Human pedigrees.
Who figured this Genetics Stuff Out? Biology Standards 3a and b Genetic s.
Classical and Modern Genetics.  “Genetics”: study of how biological information is carried from one generation to the next –Classical Laws of inheritance.
Traits & Environment Pp What are traits? Hair color Eye color HeightWeight Male vs. Female.
Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath.
Design Elements: Primitives. Try to Make these Primitives.
Challenges to the Modern Synthesis Stephen Jay Gould, Nils Eldredge,
Lecture 21: Molecular Tools of Genetic Diagnosis Reading Assignment: Chapter 42, pgs ; Harper’s Biochemistry (25 th edition). Objective: To understand.
Populations & Gene Pools and Genetic Variation.
Molecular Biology of Genes Chapters DNA Technology (not in your book)
Development of new tools to study the cell biology and origins of phenotypic variation in the human pathogen Streptococcus pneumoniae: An overview of recently.
Protein Synthesis Mutations and Genetic Disorders.
Ch5 Sec3 Advances in Genetics. Key Concepts What are three ways of producing organisms with desired traits? What is the goal of the Human Genome Project?
Single Nucleotide Polymorphisms Mrs. Stewart Medical Interventions Central Magnet School.
DNA Structure and Function. DNA Genetic instructions –Passed on Genes encode proteins –Gene = information –Protein = action hero (does the work)
Allele. Alternate form of a gene gene variant autosome.
Plant Breeing and Genetic Engineering Plant Science.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Announcements: Proposal resubmission deadline 4/23 (Thursday).
AP Biology DNA Study Guide. Chapter 16 Molecular Basis of Heredity The structure of DNA The major steps to replication The difference between replication,
Connecting Meiosis to Genetics Biology. How are meiosis & genetics related? 1. Meiosis produces gamete cells.
IPG2P Working Group Update. iPG2P Final deliverable: – Procedure allowing an investigator to begin with trait of interest in species possessing limited.
Characteristics of Living Things Big Ideas in Biology.
Mosby items and derived items © 2007, 2005, 2002 by Mosby, Inc., an affiliate of Elsevier Inc. CHAPTER 50 Gene Therapy and Pharmacogenomics.
AP Biology Chapter 15 – Mechanisms of Evolution
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
EB3233 Bioinformatics Introduction to Bioinformatics.
Have your clickers ready!. Countdown The theory of evolution. 2. Discovery of DNA. 3. The first Laws of Heredity. 4. Description of the human genome.
Dominant and Recessive Dominance Table 3. Alleles sequence of DNA any of several forms of a gene determine the genotype (genetic constitution of an organism.
Gene Mapping ROBERT SANTOS ENGLISH 100 ESP NOVEMBER
Overview of Bioinformatics Module Denis Manley.. Contact Details Lecturer Name: Denis Manley Room number: KE-1-013a
Genetics Review Honors Human Anatomy & Physiology Mr. Mazza
BIOLOGY 12 Introduction to Genetics. “If our strands of DNA were stretched out in a line, the 46 chromosomes making up the human genome would extend more.
DNA RNA Cell Membrane - Protection, transporters, sensors Nucleus - Information (the genes) THE CELL.
Ch 13-1, 13-4 & 14-1: Changing the Living World, Genetic Engineering, Human Molecular Genetics Essential Questions: What is the purpose of selective breeding?
Modelling from Sequence to Gene Regulatory Network to Phenotype.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
CST Prep: A. DNA, RNA, & Proteins Protein Differences.
Figure Molecular Biology of the Cell (© Garland Science 2008)
Figure 14-1 Molecular Biology of the Cell (© Garland Science 2008)
Statistical Applications in Biology and Genetics
Molecular genetic in Animal Production
Introduction to Biology
BIO 3344 Innovative Education--snaptutorial.com
Evolution Exams returned W 2/13 Bonus #1 due F 2/15.
KEY CONCEPT Technology continually changes the way biologists work.
Relationship between Genotype and Phenotype
Genetics: Inheritance
AIM: How are DNA molecules structured
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
B-4.2 Summarize the relationship among DNA, genes, and chromosomes.
GENE POOL All the genes of all members of a particular population.
Relationship between Genotype and Phenotype
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
INTRODUCTION TO MOLECULAR GENETICS
Working in the Post-Genomic C. elegans World
-The relationship between genes and traits. -Fields of Genetics.
Genetics vocab 1.
INTRODUCTION TO MOLECULAR GENETICS
Unit: Animals at the Cellular Level
Cell Division To be able to understand mitosis and meiosis
CHAPTER 11 GENETICS Genetics is the study of how traits or characteristics are inherited. Inherited characteristics are controlled by genes and are passed.
Natural Selection in Action
Presentation transcript:

1 Inference of Regulatory Networks via Systems Genetics Ina Hoeschele

2 Systems BiologyComplex Trait Biology Systems Genetics Infer cells regulatory structure Infer molecular basis of phenotypes / diseases

3 Systems Genetics Measure DNA sequence polymorphisms on a group of related individuals (<100 to 2000+) covering the entire genome (e.g. SNPs) Measure DNA sequence polymorphisms on a group of related individuals (<100 to 2000+) covering the entire genome (e.g. SNPs) Several genotypes at each polymorphism (e.g. two, 0/1) Several genotypes at each polymorphism (e.g. two, 0/1) Multi-factorial perturbations of a system, genetically randomized populations Multi-factorial perturbations of a system, genetically randomized populations Measure molecular and organismal variables, e.g. Measure molecular and organismal variables, e.g. Expression profiling (etraits) Expression profiling (etraits) Expression profiling and disease phenotypes Expression profiling and disease phenotypes Expression profiling, methylation profiling, disease Expression profiling, methylation profiling, disease Metabolite, protein profiling … Metabolite, protein profiling …

4 Systems Genetics The genotypes at some polymorphisms influence directly the expression of certain genes The genotypes at some polymorphisms influence directly the expression of certain genes in cis: polymorphism A in gene As promoter region influences its transcript abundance in cis: polymorphism A in gene As promoter region influences its transcript abundance in trans: polymorphism A in gene As coding region influences the function of protein A; let gene A be a regulator of gene B, then both polymorphism A and gene A influence the expression of gene B in trans: polymorphism A in gene As coding region influences the function of protein A; let gene A be a regulator of gene B, then both polymorphism A and gene A influence the expression of gene B

5 Systems Genetics The genes expression profiles (=etraits) have both polymorphism and gene (etrait) regulators The genes expression profiles (=etraits) have both polymorphism and gene (etrait) regulators Very large number of targets (regulated genes etc.) Very large number of targets (regulated genes etc.) Very large number of potential regulators for each target Very large number of potential regulators for each target Sample size (n) MUCH smaller than number of potential regulators (p) Sample size (n) MUCH smaller than number of potential regulators (p) Targets are co-regulated Targets are co-regulated Regulators are correlated Regulators are correlated Regulatory networks are cyclic Regulatory networks are cyclic Analyses of regulatory programs should account for all of the above Analyses of regulatory programs should account for all of the above

6 Systems Genetics One target – one regulator approach One target – one regulator approach Y T = + bP R + e Y T = + bP R + e do for each T and each R (except cis analysis) do for each T and each R (except cis analysis) low power low power trans: Y T = + b 1 Y R + b 2 P R + e (+ cisP) trans: Y T = + b 1 Y R + b 2 P R + e (+ cisP) better power but does not account for co- regulation of multiple targets better power but does not account for co- regulation of multiple targets

7 Systems Genetics One target – all regulators approach One target – all regulators approach T = + R b 1R P R (+ R b 2R Y R ) + e Y T = + R b 1R P R (+ R b 2R Y R ) + e do for each T, still does not account for co-regulation do for each T, still does not account for co-regulation standard variable selection methods and regularization methods tend not to perform well (n<<p, correlated regulators) standard variable selection methods and regularization methods tend not to perform well (n<<p, correlated regulators) May also need to consider interactions among loci May also need to consider interactions among loci Often ignored or limited to two-way interactions Often ignored or limited to two-way interactions Penalization/Regularization methods Penalization/Regularization methods Constrained OLS, bounds on L t norm(s) of coefficients (t=1, 2, …) Constrained OLS, bounds on L t norm(s) of coefficients (t=1, 2, …) Elastic net variable selection (Zou and Hastie 2005) Elastic net variable selection (Zou and Hastie 2005) Extension of lasso (compromise with ridge regression) Extension of lasso (compromise with ridge regression) n<<p, joint selection of correlated predictors n<<p, joint selection of correlated predictors Bayesian variable selection Bayesian variable selection Priors on b Priors on b MCMC ?? MCMC ?? Deterministic (e.g. variational) ?? Deterministic (e.g. variational) ??

8 Systems Genetics Clustering of targets Clustering of targets Analyze jointly the targets in a cluster Analyze jointly the targets in a cluster Single regulator model, multivariate analysis costly Single regulator model, multivariate analysis costly PCA within clusters, analyze PCs separately PCA within clusters, analyze PCs separately Analyze cluster with all regulator model (individual Y model but joint variable selection) Analyze cluster with all regulator model (individual Y model but joint variable selection) Geronemo: iteratively perform clustering and selection of cluster=module regulators (regression tree) (Lee et al. 2006) Geronemo: iteratively perform clustering and selection of cluster=module regulators (regression tree) (Lee et al. 2006)

9 Systems Genetics Biclustering, two-group association Biclustering, two-group association Find groups of targets regulated by groups of polymorphisms Find groups of targets regulated by groups of polymorphisms Biclustering based on matrix of associations btw targets and polymorphisms – efficient but meaningful results? Biclustering based on matrix of associations btw targets and polymorphisms – efficient but meaningful results? Various approaches for two-group association Various approaches for two-group association Penalized Canonical Correlation Analysis (CCA) Penalized Canonical Correlation Analysis (CCA) Represent CCA in regression framework Represent CCA in regression framework Bayesian CCA (probabilistic interpretation, joint latent factor model for both groups of variables) Bayesian CCA (probabilistic interpretation, joint latent factor model for both groups of variables) MCMC (convergence issues, see factor analysis) MCMC (convergence issues, see factor analysis) deterministic (variational) deterministic (variational)

10 Systems Genetics Two-step regulatory network inference Two-step regulatory network inference 1a) Construct an Undirected Dependency Graph (UDG) using target data (e.g., expression) only 1b) Determine which polymorphisms affects which targets and use this information to direct edges (e.g., Neto et al. 2008) 2a) Perform cis and trans polymorphism analysis and combine into an encompassing network (Liu et al. 2008) 2b) Sparsify the network, using structural equation modeling SEM extension of linear regression (variables can be both response and predictor) extension of linear regression (variables can be both response and predictor) likelihoods for SEM and LR not the same for cyclic networks likelihoods for SEM and LR not the same for cyclic networks Toward one-step regulatory network inference Toward one-step regulatory network inference Geronemo (etraits; small list (~300) of candidate regulator genes) Geronemo (etraits; small list (~300) of candidate regulator genes)

11