Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Correlation Aware Feature Selection Annalisa Barla Cesare Furlanello Giuseppe Jurman Stefano Merler Silvano Paoli Berlin – 8/10/2005.
COMPUTER AIDED DIAGNOSIS: FEATURE SELECTION Prof. Yasser Mostafa Kadah –
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Principal Component Analysis
Dimension reduction : PCA and Clustering Agnieszka S. Juncker Slides: Christopher Workman and Agnieszka S. Juncker Center for Biological Sequence Analysis.
Unsupervised Learning - PCA The neural approach->PCA; SVD; kernel PCA Hertz chapter 8 Presentation based on Touretzky + various additions.
Classification: Support Vector Machine 10/10/07. What hyperplane (line) can separate the two classes of data?
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Introduction to BioInformatics GCB/CIS535
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Bayesian belief networks 2. PCA and ICA
Lecture #1COMP 527 Pattern Recognition1 Pattern Recognition Why? To provide machines with perception & cognition capabilities so that they could interact.
1 Masterseminar „A statistical framework for the diagnostic of meningioma cancer“ Chair for Bioinformatics, Saarland University Andreas Keller Supervised.
JAVED KHAN ET AL. NATURE MEDICINE – Volume 7 – Number 6 – JUNE 2001
Eigenfaces for Recognition Student: Yikun Jiang Professor: Brendan Morris.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Gene expression profiling identifies molecular subtypes of gliomas
嵌入式視覺 Pattern Recognition for Embedded Vision Template matching Statistical / Structural Pattern Recognition Neural networks.
Breast Cancer Diagnosis via Linear Hyper-plane Classifier Presented by Joseph Maalouf December 14, 2001 December 14, 2001.
This week: overview on pattern recognition (related to machine learning)
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Sample classification using Microarray Data. AB We have two sample entities malignant vs. benign tumor patient responding to drug vs. patient resistant.
PCA, Clustering and Classification by Agnieszka S. Juncker Part of the slides is adapted from Chris Workman.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Class Prediction and Discovery Using Gene Expression Data Donna K. Slonim, Pablo Tamayo, Jill P. Mesirov, Todd R. Golub, Eric S. Lander 발표자 : 이인희.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 24 Nov 2, 2005 Nanjing University of Science & Technology.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
STATISTICS FOR HIGH DIMENSIONAL BIOLOGICAL RECORDINGS Dr Cyril Pernet, Centre for Clinical Brain Sciences Brain Research Imaging Centre
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Flat clustering approaches
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
2D-LDA: A statistical linear discriminant analysis for image matrix
The CALMA project A CAD tool in breast radiography A.Ceccopieri, Padova
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 10: PRINCIPAL COMPONENTS ANALYSIS Objectives:
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 3.
MicroArray Data Analysis Candice Quadros & Amol Kothari.
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Unsupervised Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
School of Computer Science & Engineering
Gene Set Enrichment Analysis
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Mixture of SVMs for Face Class Modeling
Basic machine learning background with Python scikit-learn
Outline Peter N. Belhumeur, Joao P. Hespanha, and David J. Kriegman, “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection,”
Blind Signal Separation using Principal Components Analysis
Albert Xue, Binbin Huang, Jianrong Wang
PCA, Clustering and Classification by Agnieszka S. Juncker
Analysis of GO annotation at cluster level by Agnieszka S. Juncker
Bayesian belief networks 2. PCA and ICA
Principal Component Analysis
Dimension reduction : PCA and Clustering
CS4670: Intro to Computer Vision
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Face Recognition: A Convolutional Neural Network Approach
Machine Learning – a Probabilistic Perspective
Review of Statistical Pattern Recognition
Unsupervised Learning
What is Artificial Intelligence?
Presentation transcript:

Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle or progression of cancer) you are looking at a multidimensional problem

Example: 6000 genes in 10 patients 6000 points in 10- dimensional space (gene view) 10 points in dimensional space (patient view) Principal Component Analysis (PCA) Clustering Correspondence Analysis Reduction of dimensions :

Patient view

Classification 1: patients surviving 5 years after breast cancer surgery 2: patients dead within 5 years of breast cancer surgery

Other classifiers Neural Networks Support Vector Machines Other classifiers from statistical literature

Issues in building a classifier Feature selection: a selected group of genes may be optimal (t-test) Independent validation: you must test the classifier on samples that were not used for feature selection or for building the classifier (training set - test set or leave-one- out crossvalidation)

Promoter Analysis Genes that pass the significance test are clustered and their corresponding promoter regions extracted. Regions are searched for potential transcription factor binding sites that they have in common Saco-patterns looks for exactly identical patterns Gibbs sampler allows for degeneracy of patterns with weight matrix description Transfac is a database of known transcription factor binding sites. Patterns can be assessed based on overrepresentation in cluster relative to background set.