Using visualization and network analysis to assist function analysis of microarray data Hepatitis C Virus (HCV) Micorarray Data Function Analysis Current.

Slides:



Advertisements
Similar presentations
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
13:10:58 A New Tool for Mapping Microarray Data onto the Gene Ontology Structure ( Abstract e GOn (explore Gene Ontology) is a.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Detecting Differentially Expressed Genes Pengyu Hong 09/13/2005.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Gene Expression 1. Methods –Unsupervised Clustering Hierarchical clustering K-means clustering Expression data –GEO –UCSC EPCLUST 2.
Tutorial 8 Clustering 1. General Methods –Unsupervised Clustering Hierarchical clustering K-means clustering Expression data –GEO –UCSC –ArrayExpress.
Supplementary Table 1 Table S1. Population frequency of HLA -A, -B and -C alleles. Rare alleles (frequency < 0.5%) are highlighted by a grey background.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Jesse Gillis 1 and Paul Pavlidis 2 1. Department of Psychiatry and Centre for High-Throughput Biology University of British Columbia, Vancouver, BC Canada.
Networks and Interactions Boo Virk v1.0.
Applications of Biological Network Presented and Created By : Harshit Bhatt.
Supplement 1 PCA analysis of metabolite data from untreated rice genotypes by LC-TOF and GC-TOF mass spectrometry. Top panel: (A) LC-TOF analysis of Xa21.
Scenario 6 Distinguishing different types of leukemia to target treatment.
S factor  body weight during WM  body weight during ER Supplementary figure 1: S factor distribution in each diet. S.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
Tutorial 7 Gene expression analysis 1. Expression data –GEO –UCSC –ArrayExpress General clustering methods –Unsupervised Clustering Hierarchical clustering.
Tutorial # 9 – Q#6.16 from textbook Nov. 21,
Drug–Target Network Hussein Hijazi Fall 2012 Michigan State University 1.
Using geWorkbench: Working with Sets of Data Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Supplementary Figure 1 A B C.
Biological Mechanism Profiling Using an Annotated Compound Library
Volume 44, Issue 1, Pages (January 2016)
Figure 1. Complete work-flow of the Scasat
The Functional Impact of Alternative Splicing in Cancer
A bioinformatic analysis of microRNAs role in osteoarthritis
(A) Graph depicting interaction between the five regulatory motifs measured by synergy and cooccurrence. (A) Graph depicting interaction between the five.
Total-Genome Analysis of BRCA1/2-Related Invasive Carcinomas of the Breast Identifies Tumor Stroma as Potential Landscaper for Neoplastic Initiation 
Volume 137, Issue 2, Pages (August 2009)
Network analyses of differentially expressed proteins in amniotic fluid supernatant associated with abnormal human karyotypes  Tzu-Hao Wang, M.D., Ph.D.,
Volume 7, Issue 4, Pages (April 2005)
Volume 13, Issue 9, Pages (December 2015)
Volume 23, Issue 4, Pages (April 2018)
A Call for Systematic Research on Solute Carriers
Genomic characterization of the inflammatory response initiated by surgical intervention and the effect of perioperative cyclooxygenase 2 blockade  Keith.
The Functional Impact of Alternative Splicing in Cancer
Target-Specific Precision of CRISPR-Mediated Genome Editing
Volume 3, Issue 1, Pages (July 2016)
RNA-Stabilized Whole Blood Samples but Not Peripheral Blood Mononuclear Cells Can Be Stored for Prolonged Time Periods Prior to Transcriptome Analysis 
Annabelle C. Singer, Loren M. Frank  Neuron 
Towfique Raj, Manik Kuchroo, Joseph M
Volume 97, Issue 3, Pages e8 (February 2018)
Volume 88, Issue 3, Pages (November 2015)
Statistical comparison of metabolites and analysis of differential metabolites and key metabolic pathways. Statistical comparison of metabolites and analysis.
Analysis of Microarray Data Using Z Score Transformation
Volume 10, Issue 8, Pages (August 2017)
Exploring and Presenting Results
Effects of Long-Term Visual Experience on Responses of Distinct Classes of Single Units in Inferior Temporal Cortex  Luke Woloszyn, David L. Sheinberg 
Supplementary Figure 1 A B C.
Volume 37, Issue 6, Pages (December 2012)
Volume 7, Issue 3, Pages e12 (September 2018)
Reconstructing the hematopoietic hierarchy from micro‐clusters
Differential gene expression in whole blood from SJIA patients and healthy controls. A. Data were normalized in Beadstudio using the "average" method and.
Inhibitory Actions Unified by Network Integration
Biological Mechanism Profiling Using an Annotated Compound Library
Volume 122, Issue 6, Pages (September 2005)
Different Plant Hormones Regulate Similar Processes through Largely Nonoverlapping Transcriptional Responses  Jennifer L. Nemhauser, Fangxin Hong, Joanne.
Gene Density, Transcription, and Insulators Contribute to the Partition of the Drosophila Genome into Physical Domains  Chunhui Hou, Li Li, Zhaohui S.
Volume 7, Issue 2, Pages (August 2010)
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Uncovering disease-disease relationships through the incomplete interactome by Jörg Menche, Amitabh Sharma, Maksim Kitsak, Susan Dina Ghiassian, Marc Vidal,
Volume 52, Issue 1, Pages (October 2013)
Relationship between blood cell and plasma miRNA expression among published circulating cancer biomarkers. Relationship between blood cell and plasma miRNA.
Modeling Endoplasmic Reticulum Network Maintenance in a Plant Cell
The long tail of mutational hotspots in cancer.
Presentation transcript:

Using visualization and network analysis to assist function analysis of microarray data Hepatitis C Virus (HCV) Micorarray Data Function Analysis Current approaches: Focus on identifying Gene Ontology (GO) terms that are over- represented in a set of genes of interest. Problems: Provide either too specific, or too general, level of biological information. Tend to neglect the diversity of biological function (prefer genes with one or a few GO terms). Focus on individual term, neglect the connections between them, and cannot explain the functions of a group of genes. Our approach: Analyzing in the context of a custom-build gene-function network. Data used: molecular pathogenesis of liver disease in hepatitis C virus (HCV) infection in humans. R.B.H. Williams 1, K. Xu 2, X.-X. Huang 1, C.J. Cotsapas 1, S.-H. Hong 2, G.W. McCaughan 3, M.D. Gorrell 3, P.F.R. Little 1 1. School of Biotechnology and Biomolecular Sciences, University of New South Wales, Australia. 2. VALACON project, National ICT Australia, Australia. 3. Centenary Institute of Cancer Medicine and Cell Biology, University of Sydney, Australia. Program title here in Gene Ontology Network Visualization and Analysis Microarray data Gene 1 Gene 2 Gene 3 … Gene n Candidate genes Analysis using Gene Ontology Gene Ontology term 1 Gene Ontology term 2 Gene Ontology term 3 … Gene Ontology term m Functions of interest Step 2: Statistical analysis We use the number of genes annotating to a GO term as the test statistic to select functions of interest when comparing against a set of background genes. This utilize the gene-term network structure rather than number of appearance. The results of this and next steps are shown in the figure below. 2-level gene-term networks for the 100 differentially expressed genes in HCV data. GO terms are green and genes are blue. GO terms that are highly annotated are shown in shades of red based on number of connections. Interrelationships between GO term degree, statistical significance of GO term degree and cluster membership in the 1-level gene-term network. Left panel: shows the ranked distribution of GO term degree (i.e. number of genes annotated to a term; closed black circles) and the bootstrap estimate P−value of observing the magnitude of degree compared to all genes expressed in the experiment (open white circles). GO terms with high significance are highlighted with a light grey banding (only terms with P < 0.01 are shown; no multiple comparison correction). Right panel: shows the membership of each of the terms in the top 6 clusters (ranked by number of genes and labeled c1– c6 ) detected by hierarchical clustering. Shading denotes the proportion of genes in each cluster that map to a term (black is 100% and white is 0%). Step 1: Network construction A k−level gene-term network is constructed according to the parameter k that specifies the level of abstraction: each gene is connected to the kth parent of its primary annotation. Also included are the connections among these terms in GO hierarchy. An example of 2-level network is shown below. Increasing the value of k results in the inclusion of higher level terms from the GO hierarchy. Step 3. Clustering Hierarchical clustering is used to identify groups of genes that had related functional annotations. The similarity of the GO annotations is used as the distance metrics between genes. The figure on the right is clusters 5 in the results shown in the figure above.