Dr Paul Lewis Lecturer in Bioinformatics Lecturer in Bioinformatics Cardiff University Cardiff University Biostatistics & Bioinformatics Unit Biostatistics.

Slides:



Advertisements
Similar presentations
Introduction to BioConductor Friday 23th nov 2007 Ståle Nygård Statistical methods and bioinformatics for the analysis of microarray.
Advertisements

Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Statistics Tools in GeneSpring The Center for Bioinformatics UNC at Chapel Hill Jianping Jin Ph.D. Bioinformatics Scientist Phone: (919)
DNA Microarray Bioinformatics - #27611 Program Normalization exercise (from last week) Dimension reduction theory (PCA/Clustering) Dimension reduction.
Microarray GEO – Microarray sets database
Microarray Data Preprocessing and Clustering Analysis
Gene Expression Data Analyses (3)
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Introduction to BioInformatics GCB/CIS535
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Agilent: The Company, The Myth, The Lengend. Agilent: Agilent Technologies Inc. (NYSE: A) is a world-wide, diverse technology company focused on expansion.
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Introduction to Bioinformatics - Tutorial no. 12
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker Part of the slides is adapted from Chris Workman.
ICA-based Clustering of Genes from Microarray Expression Data Su-In Lee 1, Serafim Batzoglou 2 1 Department.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
Clustering and MDS Exploratory Data Analysis. Outline What may be hoped for by clustering What may be hoped for by clustering Representing differences.
Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
Taking Raw Data Towards Analysis 1 iCSC2015, Vince Croft, NIKHEF Exploring EDA, Clustering and Data Preprocessing Lecture 2 Taking Raw Data Towards Analysis.
Classification (Supervised Clustering) Naomi Altman Nov '06.
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
More on Microarrays Chitta Baral Arizona State University.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
BioQUEST / SCALE-IT Module From Omics Data to Knowledge Case 1: Microarrays Namyong Lee Minnesota State University, Mankato Matthew Macauley Clemson University.
Carlo Colantuoni – Summer Inst. Of Epidemiology and Biostatistics, 2009: Gene Expression Data Analysis 8:30am-12:00pm in Room W2017.
Gene expression analysis
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
1 Course #412 Analyzing Microarray Data using the mAdb System April 1-2, :00 pm - 4:00pm Intended for users of the.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
EB3233 Bioinformatics Introduction to Bioinformatics.
Course Work Project Project title “Data Analysis Methods for Microarray Based Gene Expression Analysis” Sushil Kumar Singh (batch ) IBAB, Bangalore.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Analyzing Expression Data: Clustering and Stats Chapter 16.
A new initialization method for Fuzzy C-Means using Fuzzy Subtractive Clustering Thanh Le, Tom Altman University of Colorado Denver July 19, 2011.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Variability & Statistical Analysis of Microarray Data GCAT – Georgetown July 2004 Jo Hardin Pomona College
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Gist 2.3 John H. Phan MIBLab Summer Workshop June 28th, 2006.
Strategies for Metabolomic Data Analysis Dmitry Grapov, PhD.
Advanced Strategies for Metabolomic Data Analysis Dmitry Grapov, PhD.
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Lecture 15 Wrap up of class. What we intended to do and what we have done: Topics: What is the Biological Problem at hand? Types of data: micro-array,
4.0 - Data Mining Sébastien Lemieux Elitra Canada Ltd.
David Amar, Tom Hait, and Ron Shamir
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Dimension reduction : PCA and Clustering
Presentation transcript:

Dr Paul Lewis Lecturer in Bioinformatics Lecturer in Bioinformatics Cardiff University Cardiff University Biostatistics & Bioinformatics Unit Biostatistics & Bioinformatics Unit

Biostatistics & Bioinformatics Unit (BBU) Bioinformatics resource for Institutions across Wales Backing of the Higher Education Funding Council for Wales - £1.5 million grant through the Research Capacity Development Fund UWCM, Cardiff University, Aberystwyth 13 new posts in statistics & bioinformatics MSc/Postgraduate Diploma/Postgraduate Certificate:  Bioinformatics  Genetic Epidemiology and Bioinformatics

Brief Overview of Microarray Bioinformatics Introduce My Microarray Research Interests My Microarray Analysis Software

Experimental Design Differential Gene Expression Hybridisation Data Pattern Discovery Class Prediction Annotation Normalisation Bioinformatics in Microarray Experiment

Normalization Remove non-biological influences on data (systematic variation) 3 categories of Normalisation Normalisation – transform data to make more like a normal distribution log, lowess, linlog Standardisation – expand or contract distribution so data from different experiments can be compared calculate Z-scores Centralisation – move distribution so its centered around expected mean mean / median / mean trimmed centering

Experimental Design Differential Gene Expression Hybridisation Data Pattern Discovery Class Prediction Annotation Normalisation Bioinformatics in Microarray Experiment

With Replicates Parametric tests t-test (ANOVA) J. Comput. Biol : Bayesian t-test Bioinformatics : Mixture modelling & bootstrapping(SAM) P.N.A.S : Regression modelling Genome Res : All give similar results but SAM reduces false positives Non Parametric Tests Wilcoxon rank sum test Bioinformatics : Non-parametric t-test Bioinformatics : Ideal discriminator method Bioinformatics : low false positive rate but less power Find Differentially Expressed Genes Is fold change significant?

Experimental Design Differential Gene Expression Hybridisation Data Pattern Discovery Class Prediction Annotation Normalisation Bioinformatics in Microarray Experiment

Pattern Discovery & Class Prediction Explore how genes or samples group: Clustering Hierarchical Cluster AnalysisHIERARCHY K-Means Self Organising Maps (SOM)PARTITION Fuzzy ART Principal Components Analysis (PCA) Multidimensional Scaling (MDS)REDUCTION Correspondence Analysis (CoA) Assign genes to known groupings: Classification logistic regression neural networks linear discriminant analysis

Hierarchical Cluster Analysis

Partitioning Clustering Methods Need To Tell Methods Number of Clusters Genes Partitioned into Clusters What are Relationships Between Clusters? K-Means & SOM

2D & 3D Mapping Methods CoA MDS PCA Data Projected onto 2 or 3 Dimensions But….What are Cluster Boundaries?

Experimental Design Differential Gene Expression Hybridisation Data Pattern Discovery Class Prediction Annotation Normalisation Bioinformatics in Microarray Experiment

Online Tools: ARROGANT DAVID DRAGON EASE FANTOM GoMiner MatchMiner Onto-Express RESOURCERER Affymetrix GO Databases: Gene Ontology OMIM LocusLink UniGene LocusLink Annotation

My Research Interests Pattern Discovery Algorithm Development Biologist-Friendly Software Tools Take - 2D & 3D Mapping Methods Methods - Define Cluster Boundaries Make FUZZY EAS-IEAS-I 2D & 3D Visualisation Tools

Cluster Boundaries CoA MDS PCA

Fuzzy Clustering Differs to standard clust by assigning membership of a gene to all clusters Allows you to see the association of each gene within a cluster Can calculate the number of clusters in Partitioning methods (Fuzzy ART) Helps Combine Clusters Helps to clear Ambiguity

Fuzzy Mapping Add Membership values of each gene to clusters

Fuzzy Partitioning K-Means & SOM

Need for Comprehensive Pattern Discovery Software Suite Fuzzy Data Analysis Suite Visualisation Tools to explore data Easy to use Free Microarray Pattern Discovery BBUnit Web based version Service by BBU Increase traffic to BBU web site Establish BBU for microarray Cross platform

INTERFACE Normalisation Differential Gene Expression Pattern Discovery Utilities Log Normalise Mean Centre Median centre T test ANOVA Regression Hierarchical Cluster Analysis SOM K-Means Fuzzy Art PCA MDS CoA Fuzzy C-Means

Contact

Pete Kille Alan Clarke Gareth Hughes(EASI team) Karen Reed(Data) Lesley Jones(Data, & EASI Collaborator) BBU Acknowledgements