Download presentation
Presentation is loading. Please wait.
Published byGrace Andrews Modified over 11 years ago
1
1 Inference of Regulatory Networks via Systems Genetics Ina Hoeschele
2
2 Systems BiologyComplex Trait Biology Systems Genetics Infer cells regulatory structure Infer molecular basis of phenotypes / diseases
3
3 Systems Genetics Measure DNA sequence polymorphisms on a group of related individuals (<100 to 2000+) covering the entire genome (e.g. SNPs) Measure DNA sequence polymorphisms on a group of related individuals (<100 to 2000+) covering the entire genome (e.g. SNPs) Several genotypes at each polymorphism (e.g. two, 0/1) Several genotypes at each polymorphism (e.g. two, 0/1) Multi-factorial perturbations of a system, genetically randomized populations Multi-factorial perturbations of a system, genetically randomized populations Measure molecular and organismal variables, e.g. Measure molecular and organismal variables, e.g. Expression profiling (etraits) Expression profiling (etraits) Expression profiling and disease phenotypes Expression profiling and disease phenotypes Expression profiling, methylation profiling, disease Expression profiling, methylation profiling, disease Metabolite, protein profiling … Metabolite, protein profiling …
4
4 Systems Genetics The genotypes at some polymorphisms influence directly the expression of certain genes The genotypes at some polymorphisms influence directly the expression of certain genes in cis: polymorphism A in gene As promoter region influences its transcript abundance in cis: polymorphism A in gene As promoter region influences its transcript abundance in trans: polymorphism A in gene As coding region influences the function of protein A; let gene A be a regulator of gene B, then both polymorphism A and gene A influence the expression of gene B in trans: polymorphism A in gene As coding region influences the function of protein A; let gene A be a regulator of gene B, then both polymorphism A and gene A influence the expression of gene B
5
5 Systems Genetics The genes expression profiles (=etraits) have both polymorphism and gene (etrait) regulators The genes expression profiles (=etraits) have both polymorphism and gene (etrait) regulators Very large number of targets (regulated genes etc.) Very large number of targets (regulated genes etc.) Very large number of potential regulators for each target Very large number of potential regulators for each target Sample size (n) MUCH smaller than number of potential regulators (p) Sample size (n) MUCH smaller than number of potential regulators (p) Targets are co-regulated Targets are co-regulated Regulators are correlated Regulators are correlated Regulatory networks are cyclic Regulatory networks are cyclic Analyses of regulatory programs should account for all of the above Analyses of regulatory programs should account for all of the above
6
6 Systems Genetics One target – one regulator approach One target – one regulator approach Y T = + bP R + e Y T = + bP R + e do for each T and each R (except cis analysis) do for each T and each R (except cis analysis) low power low power trans: Y T = + b 1 Y R + b 2 P R + e (+ cisP) trans: Y T = + b 1 Y R + b 2 P R + e (+ cisP) better power but does not account for co- regulation of multiple targets better power but does not account for co- regulation of multiple targets
7
7 Systems Genetics One target – all regulators approach One target – all regulators approach T = + R b 1R P R (+ R b 2R Y R ) + e Y T = + R b 1R P R (+ R b 2R Y R ) + e do for each T, still does not account for co-regulation do for each T, still does not account for co-regulation standard variable selection methods and regularization methods tend not to perform well (n<<p, correlated regulators) standard variable selection methods and regularization methods tend not to perform well (n<<p, correlated regulators) May also need to consider interactions among loci May also need to consider interactions among loci Often ignored or limited to two-way interactions Often ignored or limited to two-way interactions Penalization/Regularization methods Penalization/Regularization methods Constrained OLS, bounds on L t norm(s) of coefficients (t=1, 2, …) Constrained OLS, bounds on L t norm(s) of coefficients (t=1, 2, …) Elastic net variable selection (Zou and Hastie 2005) Elastic net variable selection (Zou and Hastie 2005) Extension of lasso (compromise with ridge regression) Extension of lasso (compromise with ridge regression) n<<p, joint selection of correlated predictors n<<p, joint selection of correlated predictors Bayesian variable selection Bayesian variable selection Priors on b Priors on b MCMC ?? MCMC ?? Deterministic (e.g. variational) ?? Deterministic (e.g. variational) ??
8
8 Systems Genetics Clustering of targets Clustering of targets Analyze jointly the targets in a cluster Analyze jointly the targets in a cluster Single regulator model, multivariate analysis costly Single regulator model, multivariate analysis costly PCA within clusters, analyze PCs separately PCA within clusters, analyze PCs separately Analyze cluster with all regulator model (individual Y model but joint variable selection) Analyze cluster with all regulator model (individual Y model but joint variable selection) Geronemo: iteratively perform clustering and selection of cluster=module regulators (regression tree) (Lee et al. 2006) Geronemo: iteratively perform clustering and selection of cluster=module regulators (regression tree) (Lee et al. 2006)
9
9 Systems Genetics Biclustering, two-group association Biclustering, two-group association Find groups of targets regulated by groups of polymorphisms Find groups of targets regulated by groups of polymorphisms Biclustering based on matrix of associations btw targets and polymorphisms – efficient but meaningful results? Biclustering based on matrix of associations btw targets and polymorphisms – efficient but meaningful results? Various approaches for two-group association Various approaches for two-group association Penalized Canonical Correlation Analysis (CCA) Penalized Canonical Correlation Analysis (CCA) Represent CCA in regression framework Represent CCA in regression framework Bayesian CCA (probabilistic interpretation, joint latent factor model for both groups of variables) Bayesian CCA (probabilistic interpretation, joint latent factor model for both groups of variables) MCMC (convergence issues, see factor analysis) MCMC (convergence issues, see factor analysis) deterministic (variational) deterministic (variational)
10
10 Systems Genetics Two-step regulatory network inference Two-step regulatory network inference 1a) Construct an Undirected Dependency Graph (UDG) using target data (e.g., expression) only 1b) Determine which polymorphisms affects which targets and use this information to direct edges (e.g., Neto et al. 2008) 2a) Perform cis and trans polymorphism analysis and combine into an encompassing network (Liu et al. 2008) 2b) Sparsify the network, using structural equation modeling SEM extension of linear regression (variables can be both response and predictor) extension of linear regression (variables can be both response and predictor) likelihoods for SEM and LR not the same for cyclic networks likelihoods for SEM and LR not the same for cyclic networks Toward one-step regulatory network inference Toward one-step regulatory network inference Geronemo (etraits; small list (~300) of candidate regulator genes) Geronemo (etraits; small list (~300) of candidate regulator genes)
11
11
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.