The European Molecular Biology Laboratory (EMBL) is supported by sixteen countries. Consists of the main Laboratory in Heidelberg (Germany), Outstations.

Slides:



Advertisements
Similar presentations
Pre-SIG meeting " Genome Annotation" A BioSapiens initiative Goal of the workshop were - to create an open forum to discuss current problems on function.
Advertisements

Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
European Bioinformatic Institute.
EMBL-EBI Integration of Sequence and 3D structure Databases.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
EMBL Identity & Access Management Rupert Lück IT Services EMBL Heidelberg e-IRG Workshop Zürich Apr 24th 2008.
Gene expression analysis summary Where are we now?
Archives and Information Retrieval
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
The Cell, Central Dogma and Human Genome Project.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI.
Welcome to EMBL-EBI Dr Laura Emery. Before we start… Stand up How experienced are you in bioinformatics? Get to know each other by arranging yourselves.
An Introduction to Bioinformatics Molecular Biology Databases.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Bioinformatics.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Learning and exploring Life science through the EBI reosurces and tools BIOQUEST workshop_2011 Vicky Schneider, EMBL-EBI Training Programme Project leader.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute.
Biological Databases By : Lim Yun Ping E mail :
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
EMBL-EBI the European Macromolecular Structure Database (EMSD).
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Network requirements from Ukrainian Biotechnology communities Lubov N. Shynkarenko FBB.
EMBL-EBI Integration of Sequence and 3D structure Databases “The key to Bioinformatics is integration, integration, integration” Bioinformatics: Bringing.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
Genes and Genomic Datasets. DNA compositional biases Base composition of genomes: E. coli: 25% A, 25% C, 25% G, 25% T P. falciparum (Malaria parasite):
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
Bioinformatics and Computational Biology
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
Construction of Shanghai Life Science & Bio-technology Service Platform for Data Access and Sharing International Workshop on Strategies Presentation of.
Project Database Handler The Project Database Handler is a brokering application, which will mediate interactions between the project database and other.
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
Learning and exploring Life science through the EBI reosurces and tools BIOQUEST workshop_2011 Vicky Schneider, EMBL-EBI Training Programme Project leader.
European Molecular Biology Laboratory: An overview
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
Describing Bioinformatic Metadata at EBI James Malone
Copyright OpenHelix. No use or reproduction without express written consent1.
High throughput biology data management and data intensive computing drivers George Michaels.
1 Modelling and Simulation EMBL – Beyond Molecular Biology Physics Computational Biology Chemistry Medicine.
EMBRACE Workshop Appled Gene Ontology ITB – CNR Bari, Italy 7. – 9. November 2007 Domenica D’Elia, Giulia De Sario, Andreas Gisel, Cecilia Saccone, Angelica.
EMBL’s European Bioinformatics Institute
Archives and Information Retrieval
ELIXIR: Authentication and Authorization Infrastructure Requirements
생물정보학 Bioinformatics.
Department of Genetics • Stanford University School of Medicine
Functional Annotation of the Horse Genome
3rd Annual Forum for SMEs: Meeting Overview
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
M-H Pinard-van der Laan
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

The European Molecular Biology Laboratory (EMBL) is supported by sixteen countries. Consists of the main Laboratory in Heidelberg (Germany), Outstations in Hamburg (Germany), Grenoble (France) and Hinxton (U. K.), and an external Research Programme in Monterotondo (Italy). from from 1996

The EBI Mission  To provide Bioinformatics Facilities for the Scientific Community  To become a flagship laboratory for research in bioinformatics  To provide bioinformatics training  To help disseminate standards & technologies

Role of Bioinformatics  To Support Experimental Biology  To Collect and Archive Data  To provide Framework and Integration  To give Easy Access to Data  To make New Discoveries through Data Analysis  To predict through modelling  To facilitate application and exploitation of academic research in Medicine, Agriculture, Health and Environment

Dramatic Changes in Biology over last 5 years  Data Explosion & New Types of Data  Move towards High-Throughput Biology  Move towards Systems Biology  Much larger community – often naïve users  Growth of Applied Biology – molecular medicine, agriculture, food, environmental sciences

Genomes Hypotheses and in silico models Bioinformatics Expression- profiling Comparative genomics Mutant/RNAi data Metabolic data Literature Proteome data Biochemistry

Molecules to Cells to Organisms E.coli Genome Protein Genomes

Systems Biology Output Input CheZ CheW CheB ATP ADP Pi Methyl CheR Methyl Adaptor Flim C Pi CheY CheA

Molecular Basis of Disease p53 tumour suppressor core domain – cancers of many types Cu-Zn Superoxide Dismutase - Autosomal dominant Amyotrophic lateral sclerosis

From Structure to Functional Annotation

PQS biological assemblies MSDchem ligand data Electron Density Visualisation AstexViewer MSDPro, MSDlite SSM fold matchingSurface MatchingMSDsite Active sites Linking to Domain data, eFamily Sequence Mapping, SIFTS

From Structure To Biochemical Function Gene  Protein  3D Structure  Function Given a protein structure:  Where is the functional site?  What is the multimeric state of the protein?  Which ligands bind to the protein?  What is biochemical function?

High throughput  A new sequence every 4 seconds  web requests a day  users  5-10 core databases  cross-references  About 160 other databases

Data Growth

Web requests per day (excluding Ensembl)

ftp year million files; Terabytes

Web Servers Requestsmillions

Distinct hosts served Number users(millions)

dynamic pages domains (2005) 1..uk (United Kingdom) 21.14% 2..com (Commercial) 17.16% 3. [unknown domain] 13.37% 4.[unresolved numerical addresses] 11.05% 5..edu (USA Higher Education) 5.29% 6..net (Networks) 5.27% 7..fr (France) 4.76% 8..it (Italy) 4.68% 9..de (Germany) 2.81% 10..nl (Netherlands) 2.00%

The Services of the EBI  Nucleotide sequences  Genes  Transcription information  Protein sequences  Protein families  Macromolecular structures  Molecular interactions  Pathways  Metabolic information  Scientific Literature

Structure of EBI: Services

Apweiler, Stoesser Brazma Birney Henrick Database Integration and External Services Lopez Stoehr, Zhu

Structure of EBI: Research

Text MiningComputational Genomics Structural Proteomics Neuroinformatics Phylogeny & Evolution

EBI DATA BASES

EMBL-Bank DNA sequences

SWISS-PROT + TrEMBL Protein Sequences

EMBL-Bank DNA sequences SWISS-PROT + TrEMBL Protein Sequences EMSD Macromolecular Structure Data

EMBL-Bank DNA sequences SWISS-PROT + TrEMBL Protein Sequences Array-Express Microarray Expression Data EMSD Macromolecular Structure Data

EnsEMBL Human Genome Gene Annotation EMBL-Bank DNA sequences SWISS-PROT + TrEMBL Protein Sequences Array-Express Microarray Expression Data EMSD Macromolecular Structure Data

EnsEMBL Human Genome Gene Annotation EMBL-Bank DNA sequences SWISS-PROT + TrEMBL Protein Sequences Array-Express Microarray Expression Data EMSD Macromolecular Structure Data IntAct Protein Interactions

GKB Pathways EnsEMBL Human Genome Gene Annotation EMBL-Bank DNA sequences SWISS-PROT + TrEMBL Protein Sequences Array-Express Microarray Expression Data EMSD Macromolecular Structure Data IntAct Protein Interactions

Integration

Integrative science demands integrative resources  EBI databases have a backbone of integrative links  cross-references support trans- database navigation  Is this good enough?  sparse and coarse-grain  not straight-forward to use

Integrative science demands integrative resources Major efforts involved in integration  Interpro: database of protein families, domains and functional sites.  Interg8: data integration project co-ordinated by the EBI, to provide an integrated layer for the exploitation of genomic and proteomic data.  GRID technologies

European Patent Office  Support the inclusion of sequence data in the public databases  Development of tools to capture sequence data  Run their searches at the EBI  (similar arrangements in USA and Japan ensure exchange)  Analogous systems being developed for structure information

Industry Support

 Current successful Industry programme for Pharma  Quarterly meetings  R&D Training - workshops  Industry Forum  Funded by subscriptions  New SME programme under development

New Data Expression Data Proteomic Data Metabolome Data Chip-on- Chip AtlasesElectron tomographs Human Variation Disease Links ??

The Magic Search Box