Presentation is loading. Please wait.

Presentation is loading. Please wait.

For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish

Similar presentations


Presentation on theme: "For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish"— Presentation transcript:

1 for EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish www.ebi.ac.uk

2 What is EMBL-EBI? Europe’s home for biological data services, research and training A trusted data provider for the life sciences Part of the European Molecular Biology Laboratory, an intergovernmental research organisation International: 570 members of staff from 57 nations Home of the ELIXIR Technical hub.

3 A distributed data infrastructure for Europe EMBL-EBI is a founding member of ELIXIR: Europe’s distributed research infrastructure for biological information Mission: to support life science research and its translation to medicine, the environment, the bioindustries and society ELIXIR Nodes represent centres of excellence throughout Europe.

4 Data resources available from EMBL-EBI Genes, genomes & variation RNA Central Array Express Expression Atlas Metabolights PRIDE InterProPfamUniProt ChEMBLChEBI Molecular structures Protein Data Bank in Europe Electron Microscopy Data Bank European Nucleotide Archive European Variation Archive European Genome-phenome Archive Gene, protein & metabolite expression Protein sequences, families & motifs Chemical biology Reactions, interactions & pathways IntActReactomeMetaboLights Systems BioModelsEnzyme PortalBioSamples Ensembl Ensembl Genomes GWAS Catalog Metagenomics portal Europe PubMed Central Gene Ontology Experimental Factor Ontology Literature & ontologies

5 ELIXIR: Driven by 4 scientific use-cases Marine Metagenomics Genomic & Phenotypic data for Crop and Forest plants Rare Diseases Human Genetic Data Unlikely to start with human data due to security constraints  All scientific use cases require either private or public data sets to be replicated from the source or between analysis sites

6 Use-case characteristics Data volumes from 10’s to several 100’s of GB monthly Human data likely to be largest volume/traffic Replication between a handful of sites Download smaller subsets for individual analyses End-users widely distributed Strictly controlled access to human data, other datasets often freely available

7 ELIXIR Technical Use-Cases From the scientific use-cases, abstract Technical Use- Cases (TUCs) 23 TUCs identified so far: Infrastructure Service Directory, Cloud IaaS... Federated ID, ELIXIR ID, Credential Translation… Service Access Management, Resource Accounting... VM library, Container library... Network File Storage, File Transfer, Dataset Replication......and others

8 Use of e-Infrastructures Federated Cloud & Cluster Infrastructures Examining the reuse of EGI Cluster and Cloud Technologies Benefit from investment in federated operations GoCDB, APEL, AppDB, ARGO Data Infrastructures Working with EUDAT to identify generic data capabilities Initial focus on new service around data set replication Opportunistic reuse of EUDAT services as development progresses

9 Role of EGI/EUDAT Scientific use-case exploration still in very early stages Plan to elaborate over coming weeks Map scientific use-cases to TUCs Check for orthogonality, completeness Group, map dependencies, prioritise Correlate with EGI/EUDAT projects and timelines See how much work we can avoid doing for ourselves

10 Questions?


Download ppt "For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish"

Similar presentations


Ads by Google