Dynamic modeling of gene expression data

Slides:



Advertisements
Similar presentations
On an Improved Chaos Shift Keying Communication Scheme Timothy J. Wren & Tai C. Yang.
Advertisements

Chapter 28 – Part II Matrix Operations. Gaussian elimination Gaussian elimination LU factorization LU factorization Gaussian elimination with partial.
Generalised Inverses Modal Analysis and Modal Testing S. Ziaei Rad.
Tensors and Component Analysis Musawir Ali. Tensor: Generalization of an n-dimensional array Vector: order-1 tensor Matrix: order-2 tensor Order-3 tensor.
Lecture 18 Varimax Factors and Empircal Orthogonal Functions.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Lecture 19 Singular Value Decomposition
Principal Component Analysis CMPUT 466/551 Nilanjan Ray.
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
Lecture 17 Factor Analysis. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture 03Probability and.
SocalBSI 2008: Clustering Microarray Datasets Sagar Damle, Ph.D. Candidate, Caltech  Distance Metrics: Measuring similarity using the Euclidean and Correlation.
Some useful linear algebra. Linearly independent vectors span(V): span of vector space V is all linear combinations of vectors v i, i.e.
Unsupervised Learning - PCA The neural approach->PCA; SVD; kernel PCA Hertz chapter 8 Presentation based on Touretzky + various additions.
A New Approach to Analyzing Gene Expression Time Series Data Ziv Bar-Joseph Georg Gerber David K. Gifford Tommi S. Jaakkola Itamar Simon Learning Seminar:
Singular Value Decomposition
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 18: Latent Semantic Indexing 1.
Microarray analysis Algorithms in Computational Biology Spring 2006 Written by Itai Sharon.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Lecture 11 Vector Spaces and Singular Value Decomposition.
Dimension of Meaning Author: Hinrich Schutze Presenter: Marian Olteanu.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Transforms: Basis to Basis Normal Basis Hadamard Basis Basis functions Method to find coefficients (“Transform”) Inverse Transform.
Principal Component Analysis. Philosophy of PCA Introduced by Pearson (1901) and Hotelling (1933) to describe the variation in a set of multivariate data.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Summarized by Soo-Jin Kim
Chapter 2 Dimensionality Reduction. Linear Methods
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
6.837 Linear Algebra Review Patrick Nichols Thursday, September 18, 2003.
Patrick Nichols Thursday, September 18, Linear Algebra Review.
CS246 Topic-Based Models. Motivation  Q: For query “car”, will a document with the word “automobile” be returned as a result under the TF-IDF vector.
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
CPSC 491 Xin Liu Nov 17, Introduction Xin Liu PhD student of Dr. Rokne Contact Slides downloadable at pages.cpsc.ucalgary.ca/~liuxin.
We will use Gauss-Jordan elimination to determine the solution set of this linear system.
Generic text summarization using relevance measure and latent semantic analysis Gong Yihong and Xin Liu SIGIR, April 2015 Yubin Lim.
Computing Eigen Information for Small Matrices The eigen equation can be rearranged as follows: Ax = x  Ax = I n x  Ax - I n x = 0  (A - I n )x = 0.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
SVD Singular Value Decomposition Wolf Dan. 2 Outline Short reminder – the analyzing process Mathematical definition of the SVD How do we use SVD analysis.
Blind Information Processing: Microarray Data Hyejin Kim, Dukhee KimSeungjin Choi Department of Computer Science and Engineering, Department of Chemical.
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
Instructor: Mircea Nicolescu Lecture 8 CS 485 / 685 Computer Vision.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Initial conditions for N-body simulations Hans A. Winther ITA, University of Oslo.
MRA (from subdivision viewpoint) Jyun-Ming Chen Spring 2001.
Information Processing by Neuronal Populations Chapter 6: Single-neuron and ensemble contributions to decoding simultaneously recoded spike trains Information.
PRESENT BY BING-HSIOU SUNG A Multilinear Singular Value Decomposition.
Chapter 61 Chapter 7 Review of Matrix Methods Including: Eigen Vectors, Eigen Values, Principle Components, Singular Value Decomposition.
Linear Algebra Review Tuesday, September 7, 2010.
Principal Components Analysis ( PCA)
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
Unsupervised Learning II Feature Extraction
Unsupervised Learning II Feature Extraction
EMPIRICAL ORTHOGONAL FUNCTIONS 2 different modes SabrinaKrista Gisselle Lauren.
CS246 Linear Algebra Review. A Brief Review of Linear Algebra Vector and a list of numbers Addition Scalar multiplication Dot product Dot product as a.
Introduction to Vectors and Matrices
Review of Linear Algebra
Fig. 2 Two-dimensional embedding result obtained using nMDS.
Matrices Definition: A matrix is a rectangular array of numbers or symbolic elements In many applications, the rows of a matrix will represent individuals.
Feature Space Based Watermarking in Multi-Images
Principal Component Analysis
Subhayu Basu et al. , DNA8, (2002) MEC Seminar Su Dong Kim
Biointelligence Laboratory, Seoul National University
Introduction to Connectivity Analyses
Outline Singular Value Decomposition Example of PCA: Eigenfaces.
Comparison of EET and Rank Pooling on UCF101 (split 1)
Introduction to Vectors and Matrices
Principal Component Analysis
Restructuring Sparse High Dimensional Data for Effective Retrieval
Latent Semantic Analysis
Presentation transcript:

Dynamic modeling of gene expression data Neal S. Holter et. al. PNAS vol.98, No4, 2000 Summarized by Jinsan Yang

2002 SNU Biointelligence Lab. Introduction Assumptions The expression levels of genes at a given time are postulated to be linear combinations of their levels of at a previous time Common Goals Describe the time evolution of gene expression levels by using a time translation matrix Derive the time translation matrix by using characteristic modes of SVD (singular value decomposition) 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Introduction Notes Time translation matrix reflects the magnitude of the connectivities between genes The number of genes g far exceeds the number of time points (g x g equations are needed – ill posed problem) Nonlinear interpolation scheme: speculative Clustering genes: not clear In this paper: Use characteristic modes from SVD The casual links between the modes (hence for genes) involve just a few essential connections and any additional connections are redundant 2002 SNU Biointelligence Lab.

Singular Value Decomposition (SVD) Singular value decomposition of an (n x m) matrix A : m: gene number, n: time interval number columns of U (gene coefficient vectors) form a orthogonal basis for the gene space a columns of V (gene expression vectors) form an orthogonal basis for the expression (array) space 2002 SNU Biointelligence Lab.

Singular Value Decomposition (SVD) Calculating SVD of A : from the eigen values of AA and AA Smaller singular values correspond to ‘noise’, larger singular values correspond to principle directions in the data The columns of U are determined by The vectors (modes) are the first r rows of the matrix where each column corresponds to the times at which the corresponding expression data are measured The temporal variation of any gene j can be written exactly as a linear combination of these r characteristic modes as 2002 SNU Biointelligence Lab.

Singular Value Decomposition (SVD) For any gene j : The contribution of the first k modes to the temporal pattern of a gene j and its average over all genes : 2002 SNU Biointelligence Lab.

Singular Value Decomposition (SVD) 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Method Expression levels of r modes at time t : The linear model: The gene expression data set can be reexpressed precisely by using the r specific coefficients for each gene, M, and the initial values of each of the r modes. M can be derived by using simulated annealing methods M(i,j) describes the influence of mode j on mode i M(i,j) multiplied by the expression level of gene j at time t contributes to the expression level of gene i at time (t+t ) 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Results Datasets: yeast cell cycle (CDC15) (15) yeast sporulation (7) Human fibroblast (13) Reducing the number of modes: from the matrix M from r clusters of genes: six clusters of yeast sporulation data: Metabolic, early I, early II, middle, midlate, late 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Results The expression of 6 clusters by six modes: The interrelationships between the cluster expression patterns: measured (o) and calculated (-) expression profiles 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Results For (2 x 2) M matrix from the matrix M using 2 most important modes a: 2x2 translation matrix from initial values b: linear combination of 2 modes c: original data 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Results 2002 SNU Biointelligence Lab.

2002 SNU Biointelligence Lab. Results 2002 SNU Biointelligence Lab.