SPH 247 Statistical Analysis of Laboratory Data 1 May 12, 2015 SPH 247 Statistical Analysis of Laboratory Data.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Applications of GO. Goals of Gene Ontology Project.
CELL COMMUNICATION. YOU MUST KNOW… THE 3 STAGES OF CELL COMMUNICATION: RECEPTION, TRANSDUCTION, AND RESPONSE HOW G-PROTEIN-COUPLED RECEPTORS RECEIVE CELL.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Chapter 11 Cell Communication. Cell Signaling Evolved early in the History of Life.
Cell To Cell Communication
Cell signaling: responding to the outside world Cells interact with their environment by interpreting extracellular signals via proteins that span their.
Chapter 5 Ligand gated ion channels, intracellular receptors and phosphorylation cascades.
CELL CONNECTIONS & COMMUNICATION AP Biology Ch.6.7; Ch. 11.
11.2 Reception: A signaling molecule binds to a receptor protein, causing it to change shape A receptor protein on or in the target cell allows the cell.
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
Biology 107 Cellular Communication October 7, 2002.
COG and GO tutorial.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
Molecular Physiology: Enzymes and Cell Signaling.
Chapter 6 Protein Phosphorylation. Objectives Know the general enzymes involved in phosphorylation Know the general enzymes involved in dephosphorylation.
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
Books Molecular Cell Biology Lodish
Using The Gene Ontology: Gene Product Annotation.
Essential Bioinformatics and Biocomputing Module (Tutorial) Biological Databases Lecturer: Chen Yuzong Jan 2003 TAs: Cao Zhiwei Lee Teckkwong, Bernett.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
SPH 247 Statistical Analysis of Laboratory Data 1May 14, 2013SPH 247 Statistical Analysis of Laboratory Data.
Chapter 6-10 AP Biology. Define phagocytosis and pinocytosis. What does it mean for a cell to have a concentration gradient?
Chapter 11: Cell Communication. Essential Knowledge 2.e.2 – Timing and coordination of physiological events are regulated by multiple mechanisms (11.1).
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
Cell Communication Chapter 9. Please note that due to differing operating systems, some animations will not appear until the presentation is viewed in.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings PowerPoint Lectures for Biology, Seventh Edition Neil Campbell and Jane Reece.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Cell Signaling basics.
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
Cell Communication Chapter Cell Communication: An Overview  Cells communicate with one another through Direct channels of communication Specific.
Protein Kinases Primary elements in signal transduction
Chapter 11 Cell Communication. Concept Check Questions Chapter 11 Cell Communication.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
 Signaling molecules that function within an organism to control metabolic processes within cells, the growth and differentiation of tissues, the synthesis.
The Three Stages of Cell Signaling: A Preview
Cell Communication.
11.3 Transduction: Cascades of molecular interactions relay signals from receptors to target molecules in the cell  Signal transduction usually involves.
GO-Slim term Cluster frequency cytoplasm 1944 out of 2727 genes, 71.3% 70 out of 97 genes, 72.2% out of 72 genes, 86.1% out.
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
Negative regulation of cell cycle by intracellular signals Checkpoint p53 detects DNA damage & activates p21 p21 inhibits cdk2-cyclinA Intracellular Regulation.
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
2/3/2005 Gene Ontology (GO) The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
CELL SIGNALING: Cell signaling is part of a complex system of communication that governs basic cellular activities and coordinates cell actions. The ability.
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
Gene Annotation & Gene Ontology
Annotating with GO: an overview
Overview: Cellular Messaging
Introduction to the Gene Ontology
Integrin signalling Vytášek 2010.
Overview of Cellular Signaling Mechanisms
Expression of Human Genes
Integrin signalling Vytášek 2009.
Gene expression analysis
Annotating Gene Products to the GO
Source Page Understanding for Heterogeneous Molecular Biological Data
Cell Signaling by Receptor Tyrosine Kinases
Presentation transcript:

SPH 247 Statistical Analysis of Laboratory Data 1 May 12, 2015 SPH 247 Statistical Analysis of Laboratory Data

Annotation Given that one has found one of more genes that are differentially expressed, there are a number useful things to know What is the putative function? What pathways are know to contain this gene? What other proteins interact with the given protein? etc. May 12, 2015BST 226 Statistical Methods for Bioinformatics2

Two-color array example May 12, 2015BST 226 Statistical Methods for Bioinformatics3 > alldata[1,] [1] [16] > geneID[1,] Name ID 1 NM_ discoidin domain receptor family, member

May 12, 2015BST 226 Statistical Methods for Bioinformatics4 Official SymbolDDR2 provided by HGNCHGNC Official Full Name discoidin domain receptor tyrosine kinase 2 provided by HGNCHGNC Primary sourceHGNC:2731HGNC:2731 Locus tagRP11-572K18.1 See relatedEnsembl:ENSG ; HPRD:01868; MIM:191311; Vega:OTTHUMG Ensembl:ENSG ;HPRD:01868;MIM:191311;Vega:OTTHUMG Gene typeprotein coding RefSeq statusREVIEWED OrganismHomo sapiensHomo sapiens LineageEukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo Also known asTKT; MIG20a; NTRKR3; TYRO10 SummaryReceptor tyrosine kinases (RTKs) play a key role in the communication of cells with their microenvironment. These molecules are involved in the regulation of cell growth, differentiation, and metabolism. In several cases the biochemical mechanism by which RTKs transduce signals across the membrane has been shown to be ligand induced receptor oligomerization and subsequent intracellular phosphorylation. This autophosphorylation leads to phosphorylation of cytosolic targets as well as association with other molecules, which are involved in pleiotropic effects of signal transduction. RTKs have a tripartite structure with extracellular, transmembrane, and cytoplasmic regions. This gene encodes a member of a novel subclass of RTKs and contains a distinct extracellular region encompassing a factor VIII-like domain. Alternative splicing in the 5' UTR results in multiple transcript variants encoding the same protein. [provided by RefSeq, Jul 2008]

Affy Example May 12, 2015BST 226 Statistical Methods for Bioinformatics5 > source(" > biocLite("annaffy") > biocLite("hgu95av2.db") > library(annaffy) > library(affy) Loading required package: Biobase Loading required package: tools … Loading required package: GO Loading required package: KEGG

May 12, 2015BST 226 Statistical Methods for Bioinformatics6 > probeids <- featureNames(eset)[allp2.adj <.05] > probeids[1:5] [1] "1005_at" "1009_at" "1034_at" "1035_g_at" "1045_s_at" > symbols <- aafSymbol(probeids,"hgu95av2.db") Loading required package: hgu95av2 > symbols[1] An object of class "aafList" [[1]] An object of class “aafSymbol” [1] "DUSP1" > getText(symbols[1]) [1] "DUSP1" > descs <- aafDescription(probeids,"hgu95av2.db")[1] > getText(descs)[1] [1] "dual specificity phosphatase 1" > gos <- aafGO(probeids,"hgu95av2.db")

May 12, 2015BST 226 Statistical Methods for Bioinformatics7 > gos[1] An object of class "aafList" [[1]] An object of class "aafGO" [[1]][[1]] An object of class "protein amino acid "Biological "IEA" [[1]][[2]] An object of class "response to oxidative "Biological "TAS" [[1]][[3]] An object of class "cell "Biological "IEA”

May 12, 2015BST 226 Statistical Methods for Bioinformatics8 [[1]][[4]] An object of class "non-membrane spanning protein tyrosine phosphatase "Molecular "TAS" [[1]][[5]] An object of class "protein "Molecular "IPI" [[1]][[6]] An object of class "hydrolase "Molecular "IEA"

May 12, 2015BST 226 Statistical Methods for Bioinformatics9 [[1]][[7]] An object of class "MAP kinase tyrosine/serine/threonine phosphatase "Molecular "IEA" There are actually 33 terms

GO Evidence Codes IEA = inferred from electronic annotation (e.g., BLAST). Uncurated TAS = traceable author statement (i.e., someone said so). May 12, 2015BST 226 Statistical Methods for Bioinformatics10

IDA = inferred from direct assay IEP = inferred from expression pattern IGI = inferred from genetic interaction IMP = inferred from mutant phenotype IPI = inferred from physical interaction ISS = inferred from sequence similarity NAS = non-traceable author statement ND = no biological data available NR = not recorded May 12, 2015BST 226 Statistical Methods for Bioinformatics11

Online Access > gbs <- aafGenBank(probeids,"hgu95av2.db") > getURL(gbs[[1]]) [1] " fcgi?cmd=search&db=nucleotide&term=X68277% 5BACCN%5D&doptcmdl=GenBank" > lls <- aafLocusLink(probeids,"hgu95av2.db") > getURL(lls[[1]]) [1] " Db=gene&Cmd=DetailsSearch&Term=1843" May 12, 2015BST 226 Statistical Methods for Bioinformatics12

Abstracts > pmids <- aafPubMed(probeids,"hgu95av2.db") > pmids[[1]] An object of class "aafPubMed" [1] [13] > pmids[1] An object of class “aafPubMed” [1] [9] [17] [25] [33] [41] > browseURL(getURL(lls[[1]])) May 12, 2015BST 226 Statistical Methods for Bioinformatics13

Abstracts > pmids <- aafPubMed(probeids,"hgu95av2.db") > pmids[[1]] An object of class "aafPubMed" [1] [9] [17] [25] [33] [41] [49] [57] [65] [73] [81] [89] [97] [105] [113] [121] [129] [137] [145] [153] May 12, 2015BST 226 Statistical Methods for Bioinformatics14

Direct Browsing > browseURL(getURL(lls[[1]])) > browseURL(getURL(gbs[[1]])) > browseURL(getURL(pmids[1])) May 12, 2015BST 226 Statistical Methods for Bioinformatics15

Top Genes > probeids.ord <- featureNames(eset)[order(allp2.adj)] > getText(aafSymbol(probeids.ord[1:10],"hgu95av2.db")) [1] "S100A2" "" "RPLP1" "GM2A" "" "RPS17" "GAPDH" "COPA" [9] "PSPHP1" "" > getText(aafDescription(probeids.ord[1:10],"hgu95av2.db")) [1] "S100 calcium binding protein A2" [2] "" [3] "ribosomal protein, large, P1" [4] "GM2 ganglioside activator" [5] "" [6] "ribosomal protein S17" [7] "glyceraldehyde-3-phosphate dehydrogenase" [8] "coatomer protein complex, subunit alpha" [9] "phosphoserine phosphatase pseudogene 1" [10] "" May 12, 2015BST 226 Statistical Methods for Bioinformatics16

> aafGO(probeids.ord[7],"hgu95av2.db") An object of class "microtubule cytoskeleton "Biological "ISS" [[1]][[2]] An object of class "glyceraldehyde-3-phosphate dehydrogenase (NAD+) (phosphorylating) "Molecular "ISS" [[1]][[3]] An object of class "glyceraldehyde-3-phosphate dehydrogenase (NAD+) (phosphorylating) "Molecular "NAS" [[1]][[4]] An object of class "protein "Molecular "IPI" May 12, 2015BST 226 Statistical Methods for Bioinformatics17 There are actually 41 terms