Interactions and Ontologies

Slides:



Advertisements
Similar presentations
Bioinformatics growth curves Medline records Computer power DNA sequences 3-D structures.
Advertisements

Sandra Orchard EMBL-EBI Molecular Interactions
INTRODUCTION TO BIOPERL Gautier Sarah & Gaëtan Droc.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Beyond PubMed and BLAST: Exploring NCBI tools and databases Kate Bronstad David Flynn Alumni Medical Library.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
The IntAct Database Sandra Orchard & Birgit Meldal.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Gene Ontology John Pinney
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Archives and Information Retrieval
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Bioinformatics & LIS A brief talk for librarians, information scientists, and computer scientists about resources and collaborative opportunities with.
BIND: the Biomolecular Interaction Network Database Gary D. Bader, Doron Betel and Christopher W. V. Houge Seminar in Bioinformatics Elinor Heller.
Data retrieval BioMart Data sets on ftp site MySQL queries of databases Perl API access to databases Export View.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Ch10. Intermolecular Interactions and Biological Pathways
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
An Ontology for Protein- Protein Interaction Data Karen Jantz CIS Honors Project December 7, 2006.
Copyright OpenHelix. No use or reproduction without express written consent1.
NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
NCBI Genome Workbench Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 15, 2004 Slides from Michael Dicuccio’s Genome Workbench.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
InterPro Sandra Orchard.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Lab Interactions and Ontologies LAB CBW Bioinformatics Workshop February 23 th 2006, Toronto Christopher Hogue Blueprint Initiative.
Introduction to PubChem BioAssay
Getting GO annotation for your dataset
Networks and Interactions
Biological Databases By: Komal Arora.
Data-intensive Computing: Case Study Area 1: Bioinformatics
Systems Biology Tools for working with BIND data
SAGExplore web server tutorial for Module III:
Networks and Pathways I
NCBI Molecular Biology Resources
Using ArrayExpress.
Mirela Andronescu February 22, 2005 Lab 8.3 (c) 2005 CGDN.
The Complex Portal Birgit Meldal
Saccharomyces Genome Database (SGD)
Bioinformatics Capstone Project
Department of Genetics • Stanford University School of Medicine
Genome Annotation Continued
Annotation: linking literature to gene products
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Searching the NCBI Databases
Annotation Presentation
Basic Local Alignment Search Tool
Supporting High-Performance Data Processing on Flat-Files
(A) Design of the PhosphoPep database.
Presentation transcript:

Interactions and Ontologies First & Last Name February X, 2003 Interactions and Ontologies CBW Bioinformatics Workshop February 24th 2004, Vancouver Christopher Hogue Blueprint Initiative Lab 10.4 (c) 2003 CGDN

In the lab Predicting Interactions with First & Last Name February X, 2003 In the lab Predicting Interactions with STRING PreBIND BIND BLAST BIND Stats, Divisions, MMDBBIND BIND Searching, Filtering, Reporting, Exporting Exporting Data to Excel & Cytoscape OntoGlyphs & BIND Interaction Viewer BIND Index SeqHound & SeqHound API Lab 10.4 (c) 2003 CGDN

Predicting Interactions How to make reasonable-quality integrative interaction predictions? STRING Pre-BIND BIND-BLAST Beware of GiGo. Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

PreBIND Organism Constrained Vocabulary Hybrid Text-Mining Approach Score abstracts that have pairs of gene/protein names Score these for matches to patterns of words found in “interaction abstracts” Score them again for “interaction sentences” Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

BIND-BLAST Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Co-localization filtering! Lab 10.4

Lab 10.4

BIND Divisions and Searching Lab 10.4

From BIND Stats to Sets Lab 10.4

Lab 10.4

Lab 10.4

41 Identifier Searches Supported Lab 10.4

BIND Divisions Blueprint Databases Partner Databases BIND Taxroot BIND Fungi BIND Metazoa RefBIND Taxroot BIND 3DBP BIND 3DSM (SMID) Partner Databases HIV-HPID MIPS FlyBASE MGI SGD IMEX Partners MINT IntACT DIP (?) Pathway Partners Lab 10.4

Division Controls Lab 10.4

Displays Interactions in Cn3D, and generates experimental MMDBBIND Displays Interactions in Cn3D, and generates experimental annotation from PDB comments Lab 10.4

MMDBBIND is a large fraction of BIND – most of the records are oligomers. Lab 10.4

Ribosomal protein contacts Lab 10.4

A ribosome interaction record Ribosomal protein L5AB Lab 10.4

Lab 10.4

Individual contacts between L5AB and rRNA are highlighted in yellow

BIND Searching, Filtering, Reporting, Exporting Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

Lab 10.4

OntoGlyphs A graphical language Derived from Gene Ontology annotation The most-used terms/categories A means of compressing molecular function concepts into tiny spaces… Lab 10.4

Lab 10.4

Annotation Links Links to Domain DBs Links to AmiGO Lab 10.4

All GO sources are listed with Evidence Codes Lab 10.4

Lab 10.4

Lab 10.4

HIV Integrase Interaction Network Red – DNA binding Blue – Protein Transport Lab 10.4

DNA Binding/Transcription Proteasome Chaperones Nuclear Transport DNA Binding/Transcription Lab 10.4

BIND Viewer Tool – atp14 Many hits From yeast-two-hybrid data Lab 10.4

Lab 10.4

Too many… Which molecules are co-localized With the atp14? Lab 10.4

Select Lab 10.4

Invert Lab 10.4

Hide Selected Lab 10.4

Voila Only co-localized Proteins! Lab 10.4

Ontoglyps - complete details on blueprint.org Lab 10.4

Blueprint’s FTP Site and the BIND Index First & Last Name February X, 2003 Blueprint’s FTP Site and the BIND Index Lab 10.4 (c) 2003 CGDN

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

FTP Site Browser Lab 10.4

First & Last Name February X, 2003 The BIND Index contains selected fields for BIND records in plain-text, tab-delimited format. ftp://ftp.blueprint.org/pub/BIND/current/bindflatfiles/bindindex/ These files may be used to locate BIND records that describe a given biomolecule. BIND records contain many other fields than those listed in these indices. Complete BIND records are available in XML or ASN.1 format from ftp://ftp.bind.ca/pub/BIND/data. Lab 10.4 (c) 2003 CGDN

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

SeqHound Lab 10.4

Application Programing Interface What is SeqHound? GenBank MMDB MedLine BIND Taxonomy, GO, LocusLink, CDD SeqHound SeqHound Module Database Interface Application Programing Interface Local Programmer Web Interface Remote Interface Lab 10.4 Web user Remote Programmer

First & Last Name February X, 2003 SeqHound is a bioinformatics application programming platform. A remote application programming interface (API) in C, C++, PERL or Java is available. This API will give you access to a database of biological sequences and structures. Seqhound also stores additional information related to each of these sequences. This includes links to Genome Ontology descriptions, Medline abstracts, taxon descriptions, associated structures, redundant sequences, sequence neighbours, conserved domains, database cross-references, Online Mendelian Inheritance in Man identifiers, LocusLink identifiers and complete genomes. Lab 10.4 (c) 2003 CGDN

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

A sample SeqHound program using the Perl API #!/usr/bin/perl -w use strict; use SeqHound; print "Starting Program\n"; print "SHoundInit "; my $aa = SHoundInit("TRUE", "myapp"); print "$aa\n"; print "SHoundIsInited "; $aa = SHoundIsInited(); print "SHoundFindAcc "; $aa = SHoundFindAcc("CAA28783"); print "SHoundFini "; $aa = SHoundFini(); Lab 10.4

First & Last Name February X, 2003 Lab 10.4 (c) 2003 CGDN

Lab 10.4