Presentation is loading. Please wait.

Presentation is loading. Please wait.

EBI resources introductory course Pablo Porras Millán

Similar presentations


Presentation on theme: "EBI resources introductory course Pablo Porras Millán"— Presentation transcript:

1 EBI resources introductory course Pablo Porras Millán pporras@ebi.ac.uk www.ebi.ac.uk

2 Schedule 8:30 - 9:30Intro to EBI 9:30 - 10:00Expectations assessment 10:00 - 11:30 Browsing the genome and exploring sequences: DNA & RNA services Ensembl, Ensembl Genomes, ENA. 11:30 - 12:00Break 12:00 - 12:30Studying expression profiles: Gene expression services Array Express and Expression Atlas 12:30 - 13:30 Understanding proteins: Resources for identification and annotation GO, UniProt & InterPro 13:30 - 14:30Lunch 14:30 - 15:30 Proteomics and systems: From mass spectrometry data to models PRIDE, IntAct, Reactome & BioModels 15:30 - 16:00Break 16:00 - 16:30Small molecules bioinformaticsChEMBL, ChEBI, Metabolights 16:30 - 17:00Expectations re-assessments, Q&A

3 The hub for bioinformatics in Europe The EMBL-European Bioinformatics Institute

4 What is EMBL-EBI? Part of the European Molecular Biology Laboratory International, non-profit research institute Europe’s hub for biological data, services and research

5 The European Molecular Biology Laboratory Grenoble Structural biology Hinxton, Cambridge Bioinformatics Hamburg Structural biology Heidelberg Basic research Administration EMBO EMBL staff: 1500 people >60 nationalities EMBL staff: 1500 people >60 nationalities Monterotondo, Rome Mouse biology

6 EMBL-EBI’s mission Provide freely available data and bioinformatics services to all facets of the scientific community in ways that promote scientific progress Contribute to the advancement of biology through basic investigator-driven research in bioinformatics Provide advanced bioinformatics training to scientists at all levels, from PhD students to independent investigators Help disseminate cutting-edge technologies to industry Coordinate biological data provision throughout Europe

7 EMBL member states Austria, Belgium, Croatia, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the United Kingdom Associate member state: Australia

8 Data and tools for molecular life science Services www.ebi.ac.uk/services

9 What services do we provide? Labs around the world send us their data and we… Archive it Classify it Share it with other data providers Analyse it …provide tools to help researchers use it A virtuous circle

10 Bioinformatics underpins research Genomes Nucleotide sequence Gene expression Protein families, domains and motifs Protein structure Protein-protein interactions Chemical entities Pathways Systems Literature Protein sequence and proteomes

11 Standards – international collaborations Genome annotation www.geneontology.org Genome annotation www.geneontology.org Functional Genomics Data Society www.fged.org Protein sequence www.uniprot.org Protein sequence www.uniprot.org HUPO- Proteomics Standards Initiative (PSI) www.psidev.info/ HUPO- Proteomics Standards Initiative (PSI) www.psidev.info/ Protein structure www.wwpdb.org Protein structure www.wwpdb.org Cheminformatics www.ebi.ac.uk/chebi Cheminformatics www.ebi.ac.uk/chebi Pathways www.reactome.org www.biopax.org Pathways www.reactome.org www.biopax.org Systems modelling standards www.sbml.org Systems modelling standards www.sbml.org Metabolomics Standards Initiative (MSI) www.metabolomicssociety.org Metabolomics Standards Initiative (MSI) www.metabolomicssociety.org Genomics Standards Consortium (GSC) http://gensc.org Genomics Standards Consortium (GSC) http://gensc.org Nucleotide sequence www.insdc.org Nucleotide sequence www.insdc.org

12 EMBL-EBI users: a one-day snapshot

13 Key facts about our services Freely available A comprehensive collection of molecular databases Globally coordinated data collection and dissemination Produced in collaboration with other world leaders: NCBI (US) National Institute of Genetics (Japan) SIB Swiss Institute of Bioinformatics (Switzerland) Wellcome Trust Sanger Institute (UK)

14 Data resources DNA & RNA genes, genomes & variation Gene expression RNA, protein & metabolite expression Proteins sequences, families & motifs Structures molecular & cellular structures Systems reactions, interactions & Chemical biology chemogenomics & metabolomics Ontologies taxonomies & controlled vocabularies Literature Scientific publications & patents Other software cross-domain tools & resources pathways

15 The EBI Search Service Gene and protein summaries Data organised by: gene expression protein structure literature Data organised by: gene expression protein structure literature Species selector allows for easy comparison Explore the data and return easily to your results Explore the data and return easily to your results

16 Bioinformatics tools Over 100 analysis tools Results enriched with data from EBI resources Nucleotide sequence search e.g. BLAST nucleotide Protein sequence search e.g. BLAST protein, PSI-Search Multiple sequence alignment e.g. Clustal Omega, MUSCLE Pairwise sequence alignment e.g. Needle Protein functional analysis e.g. InterProScan Functional genomics tools e.g. Expression Atlas Molecular structure analysis e.g. PDBeFold Text mining e.g. EBIMed, Whatizit

17 Run tasks on EBI servers, using EBI data Ideal for large scale analyses, repetitive tasks and internal pipelines Integration of EBI resources and data EBI Search, tools, data retrieval Same programs, data and results enrichment as running via the web pages www.ebi.ac.uk/tools/webservices Programmatic access: EBI Web Services

18 Data-driven discovery PhD and postdoctoral programmes Research www.ebi.ac.uk/research

19 Research themes Genes & gene expression Paul Bertone Ewan Birney Alvis Brazma Anton Enright Paul Flicek Nick Goldman Genes & gene expression Paul Bertone Ewan Birney Alvis Brazma Anton Enright Paul Flicek Nick Goldman Proteins, structures & chemical biology Alex Bateman Gerard Kleywegt John Overington Christoph Steinbeck Sarah Teichmann Janet Thornton Proteins, structures & chemical biology Alex Bateman Gerard Kleywegt John Overington Christoph Steinbeck Sarah Teichmann Janet Thornton Systems biology Pedro Beltrao John Marioni Julio Saez-Rodriguez Oliver Stegle Systems biology Pedro Beltrao John Marioni Julio Saez-Rodriguez Oliver Stegle

20 Research leaders John Overington Janet Thornton Christoph Steinbeck Ewan Birney Paul Flicek Nick Goldman John Marioni Oliver Stegle Gerard Kleywegt Paul Bertone Alex Bateman Sarah Teichmann Alvis Brazma Anton Enright Pedro Beltrao Julio Saez- Rodriguez

21 Examples of EMBL-EBI research What is the molecular basis of ageing? How do the neurons of someone with Parkinson’s disease signal differently from healthy neurons? What makes a stem cell decide to become skin or muscle? Which of these proteins will make good targets for drugs? Which of these changes to a genome’s structure drive cancer?

22 PhDs and Postdocs EMBL International PhD programme: www.embl.de/training/eipp Postdoctoral positions available from: www.ebi.ac.uk/jobs Postdoctoral fellowships: EIPOD EMBL sponsored: interdisciplinary ESPOD EBI–Sanger: combined experimental/computational

23 User training www.ebi.ac.uk/training For scientists working at all levels

24 Bioinformatics training Train at EMBL-EBI Gain hands-on experience in our state-of- the-art facilities. Train online Learn in your own time, at your own pace with our freely available online courses. Train at your place Choose the training that’s right for you and your colleagues - and our experts will come to you. www.ebi.ac.uk/training

25 Train online Free online courses Learn in your own time, at your own pace Created for life-science researchers No previous knowledge of bioinformatics needed www.ebi.ac.uk/training/onlin e

26 Support and collaboration Interactions with industry www.ebi.ac.uk/industry

27 The EMBL-EBI Industry Programme Helping industry make the most of innovations in bioinformatics Neutral ground for members to explore developments and concepts Pre-competitive collaboration Standards development Technical development Input into services development “The Programme’s regular meetings foster inter-company interactions as we collaborate on special projects and liaise on other industry initiatives.” - Bertram

28 Industry Programme members Astellas Pharma Inc. AstraZeneca Bayer Pharma AG Boehringer Ingelheim Bristol-Meyers-Squibb Eli Lilly and Company F. Hoffmann-La Roche GlaxoSmithKline Johnson & Johnson Pharmaceutical R&D Merck Serono S.A. Nestlé Institute of Health Sciences Novartis Pharma AG Novo Nordisk Syngenta Sanofi-Aventis Recherche & Développement UCB Unilever

29 EMBL member states The European Commission The Wellcome Trust Research Councils UK US National Institutes of Health With thanks to our funders Supported by the European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement for Affinomics (FP7-241481).

30 A brief introduction to standards and data integration

31 Lazebnik, Biochemistry (Mosc). 2004, PMID: 15627398

32

33 Serendipitiously Recovered Component Most Important Component Really Important Component Undoubtedly Most Important Component

34 A model that reflects reality The biologist’s model

35 Standards Images from: http://archive.nrc-cnrc.gc.ca/eng/projects/inms/mass.html http://commons.wikimedia.org/wiki/File:Ce-logo.jpg http://www.nmpdr.org/FIG/wiki/view.cgi/FIG/FastaFormat

36 Standards in bioinformatics Common identifiers Controlled vocabularies / ontologies Common formats Common schemas Minimum information guidelines Common query interfaces Schema Data distribution Reporting guideline Control vocabulary Format Identifiers

37 DB I I I I I Database I User The problem of data integration Ideally Reality Interface

38 Utility of Bioinformatics Scientific impact Too little bioinformatics Too many databases Too diverse interfaces Tim Hubbard

39 Data integration DB I I I I Ideally Compromise DatabaseInterface I User Combining data residing in different sources … providing users with a unified view of these data. DB I I Reality SHARED CONTROLLED VOCABULARIES!!

40 From xkcd: http://xkcd.com/927/ The danger with standards…

41 Access, exchange, sharing, portability, interoperability, annotation, comparison, verification, representation, integration, reusability. Nucleotide sequences INSDC EMBL DDBJ NCBI Molecular interactions IMEx IntAct InnateDB DIP MINT … Collaboration among data providers More data coverage Less redundancy Less inconsistency Better data management Protein indentifications ProteomeXchange PRIDE PeptideAtlas GPMDB Tranche …

42 Work group of the Proteomics Standards Initiative Community coordination to ensure deposition of Molecular Interaction data in public repositories Concentrating on … Annotation and representation of published MI data Accessibility of MI data to the user community Example of community development of standards standards: PSI-MI Data format/schema Data distribution Control vocabulary MIAPE Reporting guideline PSI-MI XML PSI-MITAB PSICQUIC MIMIx IMEx PSI-MI CV http://www.psidev.info/MI Scoring PSISCORE

43 Thank you! www.ebi.ac.uk Twitter: @emblebi Facebook: EMBLEBI YouTube: EMBLMedia


Download ppt "EBI resources introductory course Pablo Porras Millán"

Similar presentations


Ads by Google