EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI.

Slides:



Advertisements
Similar presentations
The library as a virtual research environment Bill Hubbard SHERPA Project Manager University of Nottingham.
Advertisements

From CESSDA to European Research Infrastructure Developments in cross-European data sharing.
The EMBL-European Bioinformatics Institute
Study Project The Countries and Capitals of the European Union.
Human Frontier Science Program The Human Frontier Science Program promotes international collaborations on the mechanisms underlying the complex functions.
Welcome to CERN Accelerating Science and Innovation 2 nd March 2015 – Bidders Conference – DO-29161/EN.
The Imperial College Tissue Bank A searchable catalogue for tissues, research projects and data outcomes Prof Gerry Thomas - Dept. Surgery & Cancer The.
Steven Newhouse, Head of Technical Services Virtualisation and Cloud Computing at EBI.
 Amazon Web Services announced the launch of Cluster Compute Instances for Amazon EC2.  Which aims to provide high-bandwidth, low- latency instances.
Welcome to CERN Research Technology Training Collaborating.
Welcome to EMBL-EBI Dr Laura Emery. Before we start… Stand up How experienced are you in bioinformatics? Get to know each other by arranging yourselves.
ISARE : Health indicators in the regions of Europe André Ochoa for Isare team ISARE : Health indicators in the regions of Europe André Ochoa for Isare.
A litigious incumbent and a cautious regulator and other reasons why R&E networking is expensive in some countries Marko Bonač ARNES, Slovenia
1 Establishing a Pharmacovigilance Centre Sten Olsson the Uppsala Monitoring Centre.
Steven Newhouse, Head of Technical Services European Bioinformatics Institute: ICT Challenges.
European Life Sciences Infrastructure for Biological Information ELIXIR
Learning and exploring Life science through the EBI reosurces and tools BIOQUEST workshop_2011 Vicky Schneider, EMBL-EBI Training Programme Project leader.
Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute.
Development of Censuses in Europe and Development for EC Statistical Co-operation European Commission (Eurostat) Jurgen Heimann UNFPA/PARIS 21 International.
Slovak Fundraising Centre Observer of EFA. Introducing EFA Formally established in Brussels in 2002…  The European Fundraising Association is a network.
INTERNATIONALA CONFERENCE Security and Defence R&D Management: Policy, Concepts and Models R&D HUMAN CAPITAL POLICY ASSISTANT PROFESSOR KONSTANTIN POUDIN.
Rackspace Analyst Event Tim Bell
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Notur: - Grant f.o.m is 16.5 Mkr (was 21.7 Mkr) - No guarantees that funding will increase in Same level of operations maintained.
THE EUROPEAN UNION. HISTORY 28 European states after the second world war in 1951 head office: Brussels 24 different languages Austria joined 1995.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 25 th April 2012.
Schools for Health in Europe SHE Goof Buijs NIGZ 8 June 2008 Vancouver, partnership track.
COST Workshop on Developing Knowledge- Sharing Partnerships in Europe and Central Asia Orsolya Tóth National Innovation Office Gödöllő, 4 December, 2013.
The LHC Computing Grid – February 2008 The Challenges of LHC Computing Dr Ian Bird LCG Project Leader 6 th October 2009 Telecom 2009 Youth Forum.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Map - Region 3 Europe.
© Enterprise Europe Network South West 2009 The Eurostars Programme Kenny Legg R&D Funding for the Environmental Sector – 29 June 2010 European Commission.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
European Molecular Biology Laboratory: An overview
The (IMG) Systems for Comparative Analysis of Microbial Genomes & Metagenomes: N America: 1,180 Europe: 386 Asia: 235 Africa: 6 Oceania: 81 S America:
Stakeholder Relations at Large-Scale Infrastructures The CERN Model Rolf Heuer 7 th Canadian Science Policy Conference, Ottawa, 26 November 2015.
The United States of Europe
Northern Europe Label the following countries on the next page, using the color each countries is labeled in, then add capitals to each country using a.
The European Law Students’ Association Albania ˙ Austria ˙ Azerbaijan ˙ Belgium ˙ Bosnia and Herzegovina ˙ Bulgaria ˙ Croatia ˙ Cyprus ˙ Czech Republic.
The Mission of CERN  Push back  Push back the frontiers of knowledge E.g. the secrets of the Big Bang …what was the matter like within the first moments.
25-September-2005 Manjit Dosanjh Welcome to CERN International Workshop on African Research & Education Networking September ITU, UNU and CERN.
For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish
European Life Sciences Infrastructure for Biological Information EGI 2015, Lisbon, 18 May 2015 Rafael C Jimenez, ELIXIR CTO ELIXIR.
Realising MRC’s Vision in Health and Bioinformatics MRC Open Council Meeting July 2014 Janet Valentine Head of Population Health and Informatics.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
European Life Sciences Infrastructure for Biological Information ELIXIR’s needs from the EOSC Steven Newhouse, EMBL-EBI Part of the.
Table 1. Numbers and rates of TB cases per population by country and year, EU/EEA, 2010–2014 ASR: age-standardised rate, C: case-based Source:
Distributed Computing Infrastructures for e-Science: Future Perspectives EGI Technical Forum 2012 The Clarion Congress Hotel, Freyova 945/33, Prague 18.
Atos in a nutshell CEO: Thierry Breton, since 2009 $12bn annual revenue employees in 72 countries Among Top 7 IT Service Providers WW #2 in ITO.
The Role of the Rectors’ Conferences in Europe Henriette Stöber Central European University & University of York Erasmus Mundus MAPP - Master of Public.
ECEE Enabling Clouds for eScience The open collaboration spot for cloud projects in Europe Enabling Clouds for eScience.
Table 1. Number and rate of reported confirmed syphilis cases per population by country and year, EU/EEA, 2010–2014 ASR: age-standardised rate,
Table 1. Number and rate of Legionnaires’ disease cases per population by country and year, EU/EEA, 2010–2014 ASR: age-standardised rate, C: case-based.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Best Sustainable Development Practices for Food Security UV-B radiation: A Specific Regulator of Plant Growth and Food Quality in a Changing Climate The.
Tax Policy Challenges in a Changing World. Unintended Consequences of Tax Rob Marston, “Window Tax”, 1 September 2010 uploaded via Flickr, creative commons.
Accessing the VI-SEEM infrastructure
Tools and Services Workshop
Joslynn Lee – Data Science Educator
ELIXIR: Potential areas for collaboration with e-Infrastructures
European Molecular Biology Laboratory
EMBL – European Molecular Biology Laboratory
ELIXIR: Authentication and Authorization Infrastructure Requirements
The European Parliament – voice of the people
EUROPEAN PERIOPERATIVE Safe Surgery Saves Lives
EU: First- & Second-Generation Immigrants
Bridging Open Scholarship with Humanities
Presentation transcript:

EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI

The European Molecular Biology Laboratory Grenoble Structural biology Hinxton, Cambridge Bioinformatics Hamburg Structural biology Heidelberg Basic research Administration EMBO EMBL staff: 1700 people >60 nationalities EMBL staff: 1700 people >60 nationalities Mouse biology Monterotondo, Rome

EMBL member states Austria, Belgium, Croatia, Czech Republic, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the United Kingdom Associate member states: Argentina, Australia

EMBL-EBI MISSION To provide freely available data and bioinformatics services to all facets of the scientific community in ways that promote scientific progress

European Bioinformatics Institute (EBI) International, non-profit research institute Europe’s hub for biological data services and research 570 members of staff from 53 nations Funded primarily by member states and research bodies (EC, USA, UK, Wellcome Trust)

What is bioinformatics? The science of storing, retrieving and analysing large amounts of biological information An interdisciplinary science involving: biologists biochemists computer scientists mathematicians.

EBI Provides Services and Data Resources Data Resources Public and Managed Access Individual sequence or bulk download Services Web & programmatic access for common tools Run ‘jobs’ on EMBL-EBI hardware Volume and variety of genomic data expanding EMBL-EBI data doubling every year - replication is challenging Infrastructure currently 50,000 CPUs & 46PB

Data Growth: A Community Challenge

Has sequenced 2504 individuals from 26 populations around the globe Established a reference human haplotype structure  ~80M variants with phased genotypes for the 2504 individuals Defined standards for variation data sequencing, storage and analysis The data is completely open and is available from ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/

FTP site: ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/  Raw Data Files  Accessible by Aspera  Accessible by Globus Grid FTP AWS Amazon Cloud:  FTP mirror Web site:  Release Announcements  Documentation Ensembl Style Browser:  Browse 1000 Genomes variants in Genomic Context  Variant Effect Predictor  Data Slicer  Other Tools

The International Genome Sample Resource (IGSR), a Wellcome Trust funded project that will be built on the foundation of the 1000 Genomes Project starts in IGSR plans to: Maintain the existing 1000 genomes data and move to GRCh38 Collect other data sets generated on the Coriell Cell Lines including Geuvadis Add new populations to expand the global diversity of the variant catalog

London Phenome Centres Gaining insights into clinical and epidemiological questions Led by Jeremy Nicholson, Imperial College Clinical Phenome Centre at St Mary’s How the metabolome of individual patients changes during the patient journey in the hospital Contribute data into MetaboLights data EBI Issues with handling and analysing this data Goal: More individualised response to the patients Lead to better more cost-effective outcomes

Personalised Medicine Reduced sequencing costs enable new techniques Past: 13 yrs & £2Bn Now: 2 days & £1000 Clinical integration raises privacy & ethical concerns UK: 100K Genomes Link to medical records

The Challenge Facing EMBL- EBI Current ‘Software as a Service’ model needs optimisation Web and programmatic access to services (3M unique users) Need to support complex analysis scenarios Access to both public and managed access data sets Bespoke workflows and tools across a variety of domains Hard for users to replicate data sets for local analysis ‘Infrastructure as a Service’ brings local analysis to EMBL- EBI

Embassy Cloud Work directly with EMBL-EBI data High bandwidth Low latency Robust, secure environment Not in competition with commercial cloud services Other cloud initiatives: ELIXIR-facing cloud support HELIX Nebula

Embassy Cloud Concept Virtualised EMBL-EBI Hardware Public Data Managed DataPublic Services Embassy Cloud 1Embassy Cloud 2 Embassy Cloud 3 Private Data PanCancer

Typical Uses Web Application Hosting Limited need for resources & VMs CTTV: Host intranet, databases, … Data Staging Undertake submission from local machine (following data staging) rather from remote location BRAEMBL: Remote submission unreliable due to file upload Data Analysis Large scale management and analysis of data PanCancer: 1,000 cores, 2.5 TB RAM, 0.5 PB HDD

Courtesy Marco Manca

Summary EMBL-EBI is the source of life-science data in Europe Variety of technical resources for collaborators Provided by the Technical Services cluster Open for collaboration across all data resources Submitting data New data analysis techniques New approaches to accessing and manipulating data Contact: