Preliminary Exploratory Data Analysis for Ruth’s G2P Workflow Bernice Rogowitz 4/30/2010.

Slides:



Advertisements
Similar presentations
Data - Frequency Tables and Line Plots Joseph Williams Modified by Charlotte Stripling M7D1.b Construct frequency Distributions Objective: Create and interpret.
Advertisements

VC.04 Vector Fields Acting on a Curve. Example 1: What is a Vector Field?
Asking translational research questions using ontology enrichment analysis Nigam Shah
Human growth is associated with distinct patterns of gene expression in evolutionarily conserved networks Adam Stevens, Daniel Hanson, Andrew Whatmore,
Dahlia Nielsen North Carolina State University Bioinformatics Research Center.
COLORING WITH NUMBERS. NumbersNumbers NumbersNumbers
Multiple testing adjustments European Molecular Biology Laboratory Predoc Bioinformatics Course 17 th Nov 2009 Tim Massingham,
© red ©
Geographic Information Systems GIS Output. 1. Color Theory Additive primaries blue, green, and red Subtractive primaries yellow, cyan, and magenta.
Using visualization and network analysis to assist function analysis of microarray data Hepatitis C Virus (HCV) Micorarray Data Function Analysis Current.
Introduction to Chi-Square Procedures March 11, 2010.
Scaffold Download free viewer:
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
Color. -Visual light -An integral part of the sculpture -Creates desired effect -Distinguish items -Strengthen interest.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
University of Michigan Medical School 1 Towards a Semantic Web application: Ontology-driven ortholog clustering analysis Yu Lin, Zuoshuang Xiang, Yongqun.
Geographic Information Systems GIS Output. 1. Color Theory ► Additive primaries blue, green, and red ► Subtractive primaries yellow, cyan, and magenta.
Test of Homogeneity Lecture 45 Section 14.4 Wed, Apr 19, 2006.
“My Experiment” and What I Want to Discover My experiment involved comparing the effect of long term ozone exposure on gene expression in a wild type and.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
If you say 8 color the ones in your picture purple. If you say 9 color the ones in your picture blue.
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
COLOR AFFECTING MEMORY Emily Hanwell Bellwood Antis School District/ Grade 9.
FRACTIONS & SHAPES BY:. How many of these are colored red? * out of *.
Slide 12.1 Saunders, Lewis and Thornhill, Research Methods for Business Students, 5 th Edition, © Mark Saunders, Philip Lewis and Adrian Thornhill 2009.
Color Distribution A block BrownYellowOrangeRedGreenBlue GreenYellowOrangeRedPurple M&M Skittles.
A red dress What’s this? It’s a dress. What colour is it? It’s red. It’s a red dress.
EXAMPLE 3 Standardized Test Practice SOLUTION The theoretical probability of stopping on each of the four colors is.Use the outcomes in the table to find.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
 Ms. Amy Wilensky.  The slides of this PowerPoint do not go in order. Click the navigation symbols to move through the activity.  Click to move back.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Statistical Analysis Chi Square (X2).
Chi-Square X2.
Volume 8, Issue 4, Pages (April 2017)
CS320n –Visual Programming
Schematic representation of investigation into microbial diversity, interactions, and metabolic activity using a combined meta-omic and enrichment culture.
Name: _______________________________
Fig. 4. BET bromodomain inhibition suppresses transactivation of shared transcriptional networks across HF models. BET bromodomain inhibition suppresses.
Two ways to discuss color 1) Addition 2) Subtraction
UNIT3 Colour.
Can I color yellow?. Can I color yellow?
LR LS SR SS RR RS Cluster T7 Cluster T6 Cluster T4 Cluster T1
Analysis of total and polysomal RNA clk-1(qm30) and clk-1(qm30)+WT.
Integration of omic networks in a developmental atlas of maize
Transcriptional Landscape of Cardiomyocyte Maturation
The Translational Landscape of the Mammalian Cell Cycle
What Color is it?.
Construction and analysis of the HEV-host PPI network.
Volume 8, Issue 4, Pages (April 2017)
Øyvind H. Lie et al. JACEP 2018;j.jacep
Parallel evolution of coding sequences above neutral expectation.
Cluster analysis and pathway-based characterization of differentially expressed genes and proteins from integrated proteomics. Cluster analysis and pathway-based.
Matthieu Foll, Oscar E. Gaggiotti, Josephine T
Multiple filtration to treat the putative novel peptides.
Functional interaction network based on the 234 differentially expressed genes. a) Functional network with annotated gene clusters. Functional interaction.
Volume 37, Issue 6, Pages (December 2012)
Comparison of proteomics and RNA‐Seq data.
Transcriptome analyses of four regions of the mouse brain.
Volume 4, Issue 1, Pages (January 2011)
Volume 42, Issue 6, Pages (June 2011)
Differential protein, mRNA, lncRNA and miRNA regulation by p53.
Color Box Button - Gray Type : object Type : object Type : object
DES alters uterine gene expression primarily in the epithelium.
Statistical chart of significantly differentially expressed genes
CD4+CLA+CD103+ T cells from human blood and skin share a transcriptional profile. CD4+CLA+CD103+ T cells from human blood and skin share a transcriptional.
Genome-wide Functional Analysis Reveals Factors Needed at the Transition Steps of Induced Reprogramming  Chao-Shun Yang, Kung-Yen Chang, Tariq M. Rana 
Host gene expression changes following exposure to the microbiota.
Volume 25, Issue 5, Pages e4 (May 2017)
Geographic Information Systems
Presentation transcript:

Preliminary Exploratory Data Analysis for Ruth’s G2P Workflow Bernice Rogowitz 4/30/2010

Exploratory Data Analysis Exercise A look at Bjorn’s experimental data with Lecong’s Lemma analysis Main focus: to demonstrate potential for exploratory data analysis techniques for G2P visualization –and to demonstrate ViVA capabilities Preliminary identification of “interesting” clusters of genes, which can feed additional analyses of pathway and metabolic activity This type of analysis can pave the way to creating an exploratory analysis component based on ViVA for integration into Ruth’s workflow

Experimental Data 15,085 genes 2 Experiments –Long Exposure –Short Exposure 5 Temperatures for each -- 10,12, 14, 17, 8 degrees Celsius Control Condition for each Experiment, 20 degrees Celsius Data –Gene expression values –Statistical significance, relative to control condition A quick look at expression value distributions

Distribution of p values for short and long experiments Notice: lots of highly significant values, p<=.05

Exploratory data analysis Result: Green = all those genes that are significant in all the short conditions. Note, none significantly different from the control in the warmest condition, 17 degrees Celsius. For each condition, mark values where p<=0.5 in green For each condition, mark values where p>=0.5 in in black

Identify genes by ID in “Category Table”

Genes that are significantly different from controls across14C, 12C, 10C and 8C Short Experiment

Genes that are significantly different from controls across14C, 12C, 10C and 8C Long Experiment The “yellow” genes are significantly different from controls in both the short and long experiments

Genes that are significantly different from controls across 14C, 12C, 10C and 8C Short Experiment

Two genes At 17C, there were no genes that behaved different from the control. Two genes were significantly different from their controls in all other experimental conditions –In the long- and short-duration experiments –For temperature = 14C, 12C, 10C and 8C These are: –246114_at –257252_at

Another Analysis Identify Genes that are differentially expressed in the different experiments and conditions First, identify genes that are visually different from controls. Second, filter out identified genes that are not statistically different from the controls.

Expression vs. Control Short Experiment Long Experiment

Visual Exploration: Short Condition 1.Visually identified genes in short-14 condition that were different from controls. Color them red. 2.Examine short- 8 condition. Additional genes identified in short-8, which were visually different from controls, color them green.

Visual Exploration: Short Condition 1.Genes that are active in the short-14 condition tend to be more highly differentiated in lower temperature, short-8 (red) 2.Additional genes are differentiated at the lower temperature (green)

Visual Exploration: Long Condition 1. Some red and green genes are also visually differentiated in the long condition 2.Additional genes are visually differentiated, which are not green or red. Color them blue.

Visual Exploration: Return to the Short Condition 1.Genes that are differentiated in the long condition (blue) are basically not differentiated in the short condition (no blue genes are present)

S-14 S-8 L-8

“Blue” Genes 28 genes identified which are very different from the control for the long-8 condition, but not for the short conditions. Next: which of these are significantly different?

ViVA “Category Table” Table sorted by p-value”. Not all “blue” genes significantly different from the control. To see which ones are, look at the blue genes only.

Identifying p<=0.05 deviations P<= 0.05 P<=.01

Genes that are significantly expressed for long, but not short duration P<=0.05 P<=0.01

Goal: Venn Diagram of Co- Expression (a la Ruth) 2 Short 14 Long 14 Long 8Short 8

Next step Translate gene ids to searchable format Use PlantMetGen and MapMan to identify involved pathways and metabolic functions