Clustering Gene Expression Data Using Independent Component Analysis

Slides:



Advertisements
Similar presentations
Michael R. Smith, Mark Clement, Tony Martinez, and Quinn Snell
Advertisements

University of Joensuu Dept. of Computer Science P.O. Box 111 FIN Joensuu Tel fax Gaussian Mixture.
Agent-based Modeling: A Brief Introduction Louis J. Gross The Institute for Environmental Modeling Departments of Ecology and Evolutionary Biology and.
State Estimation and Kalman Filtering CS B659 Spring 2013 Kris Hauser.
Image Modeling & Segmentation
1 Semi-supervised learning for protein classification Brian R. King Chittibabu Guda, Ph.D. Department of Computer Science University at Albany, SUNY Gen*NY*sis.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
Model-based clustering of gene expression data Ka Yee Yeung 1,Chris Fraley 2, Alejandro Murua 3, Adrian E. Raftery 2, and Walter L. Ruzzo 1 1 Department.
1 STARMAP: Project 2 Causal Modeling for Aquatic Resources Alix I Gitelman Stephen Jensen Statistics Department Oregon State University August 2003 Corvallis,
World Statistics Day Statisical Modelling of Complex Systems Jouko Lampinen Finnish Centre of Excellence in Computational Complex Systems Research.
Principal Component Analysis
First introduced in 1977 Lots of mathematical derivation Problem : given a set of data (data is incomplete or having missing values). Goal : assume the.
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
UNIVERSITY OF JYVÄSKYLÄ Topology Management in Unstructured P2P Networks Using Neural Networks Presentation for IEEE Congress on Evolutionary Computing.
Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Radford M. Neal and Jianguo Zhang the winners.
1 Accounting for Spatial Dependence in Bayesian Belief Networks Alix I Gitelman Statistics Department Oregon State University August 2003 JSM, San Francisco.
Region Based Image Annotation Through Multiple-Instance Learning By: Changbo Yang Wayne State University Department of Computer Science.
ICA-based Clustering of Genes from Microarray Expression Data Su-In Lee 1, Serafim Batzoglou 2 1 Department.
Bayesian Analysis for Extreme Events Pao-Shin Chu and Xin Zhao Department of Meteorology School of Ocean & Earth Science & Technology University of Hawaii-
Combining the strengths of UMIST and The Victoria University of Manchester Propagating Measurement Uncertainty in Microarray Data Analysis Magnus Rattray.
A Unifying Review of Linear Gaussian Models
Learning In Bayesian Networks. Learning Problem Set of random variables X = {W, X, Y, Z, …} Training set D = { x 1, x 2, …, x N }  Each observation specifies.
CSC2535: 2013 Advanced Machine Learning Lecture 3a: The Origin of Variational Bayes Geoffrey Hinton.
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Anomaly detection with Bayesian networks Website: John Sandiford.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Mining and Visualization of Flow Cytometry Data ANGELA CHIN UNIVERSITY OF HOUSTON RESEARCH EXPERIENCE FOR UNDERGRADUATES JULY 3,
University of Southern California Department Computer Science Bayesian Logistic Regression Model (Final Report) Graduate Student Teawon Han Professor Schweighofer,
Adaptive Median filtering of Still Images Arjun Arunachalam Shyam Bharat Department of Electrical Engineering.
Clustering Features in High-Throughput Proteomic Data Richard Pelikan (or what’s left of him) BIOINF 2054 April
Blind Information Processing: Microarray Data Hyejin Kim, Dukhee KimSeungjin Choi Department of Computer Science and Engineering, Department of Chemical.
Cluster Analysis Potyó László. Cluster: a collection of data objects Similar to one another within the same cluster Similar to one another within the.
IB Computer Science – Logic
Feature Selection in k-Median Clustering Olvi Mangasarian and Edward Wild University of Wisconsin - Madison.
Distributed Spanning Tree Center for Information Security Department of Computer Science University of Tulsa, Tulsa, OK
Cluster Analysis Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
CSC2535: Computation in Neural Networks Lecture 7: Independent Components Analysis Geoffrey Hinton.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Introduction to Machine Learning Nir Ailon Lecture 12: EM, Clustering and More.
A Brief Introduction to Bayesian networks
Bayesian Semi-Parametric Multiple Shrinkage
Denver County Medicaid Caseload March
A knowledge-based text annotation tool
Variational Bayes Model Selection for Mixture Distribution
Principal Component Analysis (PCA)
Number Talks: Developing Computational Fluency
Inconsistent Constraints
William Norris Professor and Head, Department of Computer Science
Contact Graduate Program Director: More information at website:
Contact Graduate Program Director: More information at website:
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
CSCI 5822 Probabilistic Models of Human and Machine Learning
Hidden Markov Models Part 2: Algorithms
William Norris Professor and Head, Department of Computer Science
A Short Tutorial on Causal Network Modeling and Discovery
Lecture 14 PCA, pPCA, ICA.
Collapsed Variational Dirichlet Process Mixture Models
Department of Computer Science University of York
CSCI 5822 Probabilistic Models of Human and Machine Learning
COMPUTATIONAL PROCESS REPRESENTATION IN A KNOWLEDGE BASE
MATH 6380J Mini-Project 1: Realization of Recent Trends in Machine Learning Community in Recent Years by Pattern Mining of NIPS Words Chan Lok Chun
Discovery of Hidden Structure in High-Dimensional Data
Probabilistic Modelling of Brain Imaging Data
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
Restructuring Sparse High Dimensional Data for Effective Retrieval
Lecture 15 Factor Analysis.
Beehive Audio Source Separation
Uncertainty Propagation
Presentation transcript:

Clustering Gene Expression Data Using Independent Component Analysis Stephen C. Billups University of Colorado at Denver Department of Mathematics Larry Hunter University of Colorado Health Sciences Center Department of Pharmacology

x=As + ν + noise

Key Points ICA clustering attractive for gene expression data: Accounts for and identifies independent hidden effects that influence gene expression. Allows clusters with markedly different shapes and dimensionalities to be identified. Bayesian approach allows prior knowledge to be incorporated. (semi-supervised learning). Algorithm works only when underlying effects have non-Gaussian distributions. The algorithm is made tractable by using a variational Bayesian method with some sensible simplifications. Opportunities exist for improving the computational efficiency.