E-BIOGENOUEST: A REGIONAL LIFE SCIENCES INITIATIVE FOR DATA INTEGRATION Datacite Annual Conference 2014 - Nancy Olivier Collin – IRISA/INRIA

Slides:



Advertisements
Similar presentations
The e-Framework Bill Olivier Director Development, Systems and Technology JISC.
Advertisements

The e-Research framework for South Africa developed by Fernihough (2011), after in depth interviews with various.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
10 de abril de 2014 Cloud Services for Projects in Bioinformatics: Technical Considerations and Business Fernando Barraza Omicsco Universidad de San Buenaventura.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
E-SCIENCE IN WESTERN FRANCE : BEGINS… Yvan Le Bras Cyril Monjeaud Olivier Collin & the GenOuest team CNRS UMR 6074 IRISA-INRIA.
©STFC/Keith G Jeffery Metadata in the European e-Infrastructure Metadata in the European e-Infrastructure Keith G Jeffery Science and Technology.
NICLS: Development of Biomedical Computing and Information Technology Infrastructure Presented by Simon Sherman August 15, 2005.
Problems of development of high performance infrastructure for scientific center S. Shikota 1, A.Yu.Menshutin 1,2, L. Shchur 1,2 1 Department of Applied.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
HEALTH & E-SCIENCE IN WESTERN FRANCE : USE CASES Yvan Le Bras Cyril Monjeaud Olivier Collin & the GenOuest team CNRS UMR 6074 IRISA-INRIA.
 The institute started in 1989 as a UNDP funded project called the National Agricultural Genetic Engineering Laboratory (NAGEL).  The Agricultural.
Good practice in Research Data Management Module 6: Tools, training and support.
Conceptual framework for a Malaria VRE in South Africa Dr Heila Pienaar (UP) & Dr Martie van Deventer (CSIR) The Research Information Centre Stakeholder.
National Center for Genome Analysis Support: Carrie Ganote Ram Podicheti Le-Shin Wu Tom Doak Quality Control and Assessment.
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
The Materials Genome Initiative and Materials Innovation Infrastructure Meredith Drosback White House Office of Science and Technology Policy September.
Strategic Research Areas “If I have seen a little further it is by standing on the shoulders of giants”
Integrated e-Infrastructure for Scientific Facilities Kerstin Kleese van Dam STFC- e-Science Centre Daresbury Laboratory
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
Data curation in an existing infrastructure: Stellenbosch University 1 st African Digital Curation Conference 12 – 13 February 2008 Wouter Klapwijk Senior.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
CyberInfrastructure workshop CSG May Ann Arbor, Michigan.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Bioinformatics Core Facility Guglielmo Roma January 2011.
European Life Sciences Infrastructure for Biological Information META-pipe WP6 Kick-off Lars Ailo Bongo, ELIXIR-NO.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Workshop on Structural and Computational Proteomics of Biological Complexes.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
Valentina Di Francesco Senior Program Officer for Bioinformatics, Structural Genomics and Systems Biology Microbial Genomics.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
The National Center for Genomic Analysis Support: creating a national cyberinfrastructure environment for genomics researchers. William Barnett, Thomas.
Children’s Health Exposure Analysis Resource (CHEAR) CHEAR Center for Data Science Susan Teitelbaum, PhD November 4, 2015.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
INFSO-RI Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry.
TOWARDS A FRENCH -SCIENCE ? Results of the e-Biogenouest project ( ) Coordination : Olivier Collin – Yvan Le Bras (IRISA) e -Test an e-Science.
Globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National Laboratory
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE08 conference, Istambul Life sciences cluster perspective on EGI V. Breton, CNRS On.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
1 Kostas Glinos European Commission - DG INFSO Head of Unit, Géant and e-Infrastructures "The views expressed in this presentation are those of the author.
Cyril Pommier et al. / Feedback from the RDA and WheatIS recommendations for Wheat Data Interoperability Adoption of the Wheat Data Interoperability Guidelines.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
SCI-BUS Sílvia Delgado Olabarriaga e-BioScience Group Bioinformatics Laboratory Dept of Epidemiology, Biostatistics and Bioinformatics.
ChinaGrid: National Education and Research Infrastructure Hai Jin Huazhong University of Science and Technology
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Recap: introduction to e-science
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
USF Health Informatics Institute (HII)
HII Technical Infrastructure
Introduction to D4Science
ELIXIR Competence Center
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Presentation transcript:

E-BIOGENOUEST: A REGIONAL LIFE SCIENCES INITIATIVE FOR DATA INTEGRATION Datacite Annual Conference Nancy Olivier Collin – IRISA/INRIA

Agenda Context Biogenouest Biology The e-biogenouest project “Bridging data, metadata and computation” A system of systems : collaborative portal, metadata management environment, data analysis portal

Biogenouest Biogenouest is a network bringing together technological core facilities dedicated to Life and Environmental Sciences in the West of France

Biogenouest Created in 2002, Biogenouest coordinates 31 technological core facilities based in the regions of Brittany and Pays de la Loire, with the aim to organize and pool interregional resources. Biogenouest also federates 70 research units involved in thematic research covering 4 areas of activity : Marine resources, Agri- food, Health and Bioinformatics.

GenOuest : Bioinformatics core facility Member of the Biogenouest network Member of the IFB : French Bioinformatics Institute National recognition : IBiSA platform Regional strategic facility for INRA (National Institute of Agronomical Research) ISO9001:2008 certified Established since to 12 people Computing infrastructure, storage, software development, expertise, R&D projects

Computation Data Workflows Portals Collaboration Grid Cloud Cluster BioMAJ SeqCrawler MetaData EMME HubZero Galaxy Mobyle Ontologies Biosciences Mobyle2 R&D projects

Computation Data Workflows Portals Collaboration Grid Cloud Cluster BioMAJ SeqCrawler MetaData EMME HubZero Galaxy Mobyle Ontologies Biosciences Mobyle2 R&D projects E-Biogenouest

Context Kahn. On the future of genomic data. Science (2011) vol. 331 (6018) pp  Now : Genomics : Next Generation Sequencing  Next : Proteomics  Next : Bio-imaging  Digital data  Huge amount  Heterogenous  Critical situation for some laboratories

E-BIOGENOUEST

E-Biogenouest Started in May 2012 for 3 years Funded by Brittany and Pays de la Loire E-science initiative for the Biogenouest network Community building Training/workshops Roadmap preparation Experimentation/Pilot project : Virtual Research Environment (VRE)

A system of systems Combination of various tools A data analysis portal : Galaxy A metadata management tool : ISAtools suite A collaborative portal : HubZero Additional utilities : Pydio : file transfer Some software glue to make it work… BioBlend : Galaxy API In-house developments

Galaxy portal Galaxy : a web based portal for biomedical data analysis Intuitive interface Workflows 800 tools (transcriptomics, population genetics, quantitative genetics, metagenomics, proteomics, etc.) Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, Zhang Y, Blankenberg D, Albert I, Taylor J, Miller W, Kent WJ, Nekrutenko A. "Galaxy: a platform for interactive large-scale genome analysis." Genome Research Oct; 15(10):

ISAtools Suite Open Source tools for experimental metadata management Enforces the description of experiments with standards or ontologies Creates local repository Allows publication to public repositories = EMME Additional developements and auxiliary tools. Rocca-Serra, P. et al. ISA software suite: supporting standards- compliant experimental annotation and enabling curation at the community level. Bioinformatics 26, 2354–6 (2010).

EMME Wet Lab Experiment DataMetaData IsaTools ISAtab files ISAarchive Link to raw data

EMME Wet Lab Experiment DataMetaData ISAarchive Galaxy Import Decompress Import Data Analysis

HubZero Scientific web portal Collaboration: wiki, blog, etc. Resources : results, articles, presentations, etc. Lightweight project management M. McLennan, R. Kennell, "HUBzero: A Platform for Dissemination and Collaboration in Computational Science and Engineering," Computing in Science and Engineering, 12(2), pp , March/April, 2010

Continuum Continuum for the management and analysis of biological data Collaborative environment HubZero GalaxyEMME

VRE : Virtual Research Environment 19 Data Versioning Provenance Security Sharing Workflows Versioning Provenance Security Sharing Web portal Project management Collaboration Dissemination Data infrastructure Computing infrastructure

A paradigm shift Data IT Environment Data IT Environment From… To…

Next steps What we learned : Acceptance / adoption issues are key issues What we will do : Switch to a production environment Identity federation ISA-Dataflow : metadata for bioinformatics workflows What we need to do : To connect to other initiatives To define the perimeter : Big changes for bioinformatics facilities

Conclusion Biology becomes a digital science New technologies with lower costs create a dangerous situation A system of systems : « metadata + collaborative tool + analysis portal » Continuum : data centered philosophy « Bring back Biology to the biologist »

Questions ?