INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE-II Bioinformatics Activity Dr Christophe Blanchet EGEE Bioinformatics Activity Leader.

Slides:



Advertisements
Similar presentations
11 Application of CSF4 in Avian Flu Grid: Meta-scheduler CSF4. Lab of Grid Computing and Network Security Jilin University, Changchun, China Hongliang.
Advertisements

INAF experience in Grid projects C. Vuerli, G. Taffoni, V. Manna, A. Barisani, F. Pasian INAF – Trieste.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyProxy and EGEE Ludek Matyska and Daniel.
Jorge Gasós Grid Technologies Unit European Commission The EU e Infrastructures Programme Workshop, Beijing, June 2005.
An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Why Grids Matter to Europe Bob Jones EGEE.
S.L.LloydATSE e-Science Visit April 2004Slide 1 GridPP – A UK Computing Grid for Particle Physics GridPP 19 UK Universities, CCLRC (RAL & Daresbury) and.
1 ALICE Grid Status David Evans The University of Birmingham GridPP 16 th Collaboration Meeting QMUL June 2006.
B A B AR and the GRID Roger Barlow for Fergus Wilson GridPP 13 5 th July 2005, Durham.
Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
Andrew Meade School of Biological Sciences.
Computing for LHC Dr. Wolfgang von Rüden, CERN, Geneva ISEF students visit CERN, 28 th June - 1 st July 2009.
CBPF J. Magnin LAFEX-CBPF. Outline What is the GRID ? Why GRID at CBPF ? What are our needs ? Status of GRID at CBPF.
INFSO-RI Enabling Grids for E-sciencE The EGEE project Fabrizio Gagliardi Project Director EGEE CERN, Switzerland Research Infrastructures.
Enabling Grids for E-sciencE INFSO-RI GPSA grid portal for Bioinformatics, EGEE3 Athens, 20/04/ GPSA - Grid Protein Sequence Analysis on the.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
INFSO-RI Enabling Grids for E-sciencE WISDOM mini-workshop Vincent Breton (CNRS-IN2P3, LPC Clermont-Ferrand) ISGC 2007 March 28th,
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Santiago de Chile, 1st EELA Conference, 4-5/9/06 1 Status.
Portals and Credentials David Groep Physics Data Processing group NIKHEF.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
EGEE (EU IST project) – Networking Activities 4: biomedical applications NA4 Biomedical Activities Johan Montagnat First EGEE Meeting Cork,
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
IST E-infrastructure shared between Europe and Latin America Biomedical Applications in EELA Esther Montes Prado CIEMAT (Spain)
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
BIOINFOGRID: Bioinformatics Grid Application for Life Science Giorgio Maggi INFN and Politecnico di Bari
INFSO-RI Enabling Grids for E-sciencE EGEE - a worldwide Grid infrastructure opportunities for the biomedical community Bob Jones.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Report from the EELA External Advisory Committee V. Breton, F. Gagliardi, M. Kunze 5/9/2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
1 Large-Scale Profile-HMM on the Grid Laurent Falquet Swiss Institute of Bioinformatics CH-1015 Lausanne, Switzerland Borrowed from Heinz Stockinger June.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE Status of the Biomedical Applications in EELA Project (E-Infrastructures Shared Between Europe.
INFSO-RI Enabling Grids for E-sciencE Biomedical applications V. Breton, CNRS-IN2P3.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE Gergely Sipos
INFSO-RI Enabling Grids for E-sciencE In silico docking on EGEE infrastructure, the case of WISDOM Nicolas Jacq LPC of Clermont-Ferrand,
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE08 conference, Istambul Biomed community meeting V. Breton, CNRS.
INFSO-RI Enabling Grids for E-sciencE EGEE Review WISDOM demonstration Vincent Bloch, Vincent Breton, Matteo Diarena, Jean Salzemann.
B i o i n f o r m a t i c s / B i o m e d i c a l A p p l i c a t i o n s i n E E L A Mexico, D.F., october 22 – 26, e – s c i e n c e M e x i c.
BIOINFOGRID: Bioinformatics Grid Application for life science MILANESI, Luciano National Research Council Institute of.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Web interface for Protein Sequence.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks BiG: A Grid Service to Distribute Large BLAST.
Enabling Grids for E-sciencE ITC-irst for NA4 biomed meeting at EGEE conference: Ginevra 2006 BioDCV - Features 1.Application for analysis of microarray.
Swiss Grid Day, Bern, November 26 th Human migrations Adapted from Cavalli-Sforza & Feldman, 2003 [12,000] [55,000] Homo sapiens sapiens.
INFSO-RI Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE08 conference, Istambul Life sciences cluster perspective on EGI V. Breton, CNRS On.
EGEE is a project funded by the European Union under contract IST Enabling bioinformatics applications to.
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bioinformatics activity Christophe BLANCHET.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
EGEE is a project funded by the European Union under contract IST Aims and organization of the Biomedical VO Yannick Legré CNRS/IN2P3 NA4/SA1.
Bioinformatics Grid Application for Life Science. COMMUNICATION NETWORK DEVELOPMENT SPECIFIC SUPPORT ACTION BIOINFOGRID Andreas Gisel & Luciano Milanesi.
1 The Life-Science Grid Community Tristan Glatard 1 1 Creatis, CNRS, INSERM, Université de Lyon, France The Spanish Network for e-Science 2/12/2010.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service to Encrypt Biological Data on Grid.
Bob Jones EGEE Technical Director
Grid Computing: Running your Jobs around the World
The LHC Computing Grid Visit of Mtro. Enrique Agüera Ibañez
MATLAB Distributed, and Other Toolboxes
Long-term Grid Sustainability
EGEE support for HEP and other applications
SA1 ROC Meeting Bologna, October 2004
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE EGEE-II Bioinformatics Activity Dr Christophe Blanchet EGEE Bioinformatics Activity Leader CNRS IBCP, Lyon, France

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 2 The EU EGEE grid project EGEE-I –1 April 2004 – 31 March 2006 –71 partners in 27 countries, federated in regional Grids EGEE-II –1 April 2006 – 31 March 2008 –91 partners in 32 countries –13 Federations Objectives –Large-scale, production-quality infrastructure for e-Science –Improving and maintaining gLite Grid middleware –Attracting new resources and users from industry as well as science Size of the infrastructure (Sept. EGEE06): 192 sites in 40 countries ~ CPU ~ 5 PB disk, + tape MSS

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) Grid EGEE06

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 4 EGEE06 Capitalising on e-Science to make e-Business

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 5 Applications Many applications from a growing number of domains –Astrophysics –Computational Chemistry –Earth Sciences –Financial Simulation –Fusion –Geophysics –High Energy Physics –Life Sciences –Multimedia –Material Sciences –… Application Identification and Support (NA4) –25 countries, 40 partners, 280+ participants, 1000s of users Support the large and diverse EGEE user community: –Promote dialog: Users Forums & EGEE Conferences –Technical Aid: Porting code, procedural issues –Liaison: Software and operational requirements

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 6 Biomedical Virtual Organization Biomed VO Management –Leader: V. Breton –Deputies: C. Blanchet & J. Montagnat –~80 participants Three active subgroups –Bioinformatics (C. Blanchet) Details in next slides –Drug discovery (V.Breton) Successful runs for malaria and avian flu virus. Similar work to be done for neglected diseases in EGEE-II. WISDOM: 1 October - 1 December, 500 CPU-years, 5 TB, Discussions underway for finalizing docking targets –Medical imaging (J. Montagnat) Kickoff meeting on July 12 in Sophia Antipolis Three application services offered from partners (MDM, Moteur, P-grade) 6 applications from EGEE, 5 new in EGEE-II Infrastructure (Dec 2006) –Computing: 113 CEs, ~15,000 CPUs –Storage: 107 SEs, ~3,5 TB –~1000 jobs/day

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 7 Bioinformatics Activity The bioinformatics sector targets gene and protein analysis, for example genomics, proteomics and phylogeny; but also system biology, genetic linkage, genetic demographic model, … Integrate biological data and tools –Select relevant applications with real grid add-value –Define and prioritize their requirements –Give feedback about satisfaction with middleware components Promote Dialog –Internal Bioinformatics meetings –Participation to other EGEE activity meetings –Scientific dissemination in national and international conferences –Collaboration with related projects: EU-EMBRACE, EU-EELA, EU- BIOINFOGRID, EU-ETICS, SwissBioGrid. Joint meetings with related projects Grid expertise –Consulting about EGEE grid platform –Help on porting bioinformatics applications –Train users and developers –Support, helpdesk Deploy applications on the production platform –10 applications –4 applications from EGEE, 6 new ones in EGEE-II –Training, collaboration with Regional Operation Center, –Add new resources: hardware and human –Give feed-back of services use.

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 8 NoE EMBRACE (EU FP6) « A European Model for Bioinformatics Research and Community Education » –simplify and standardize the way in which biological information is served to the researchers who use it. –Integrating biological data and bioinformatics tools in grid Network of Excellence ( ) –From Feb 1st, 2005 –partners: EBI (PI), EMBL, SIB, CNRS, MPI_MG, INRA, ITB CNR, CNB,... Funded by the European Union (EU-FP6, LHSG-CT ) –EMBRACE uses a test problem driven development method. The services will be developed through a set of test problems, which will use tasks from real biological research, designed to stretch the system in critical ways

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 9 Bioinformatics Applications

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 10 GPSA: Bioinformatics Grid Portal Scientific objectives –Molecular Bioinformatics: protein sequence analysis –Analyse data from high-throughput Biology: complete genome projects, EST, complete proteomes, structural biology, …. –Integration of biological data and tools Method –Provide Biologists with an usual Web interface: Web portal online since tools & 12 updated databases + 9,000,000 jobs & 5,000 jobs/day –Ease the access to updated databases and algorithms. –Protein databases are stored on grid storage as flat files. –Legacy bioinformatics applications Wrapping usual binary in grid environment transparent remote access with local filesystem –Display results in graphical Web interface. Status: Prototype

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 11 Grid-enabled bioinformatics resources –9 algorithms –3 protein databases Bioinformatics descriptors –XML framework, WSRF Encryption system: EncFile –Security: AES, Key sharing, M-of-N Transparent access to grid files: Perroquet GPSA Results

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) SPLATCHE (1) SPatiaL And Temporal Coalescences in Heterogeneous Environment Scientific objectives Study human evolutionary genetics and answer questions such as the geographic origin of modern human populations, the genetic signature of expanding populations, the genetic contacts between modern humans and Neanderthals, and the expected null distributions of genetic statistics applied on genome-wide data sets. Method Simulate the past demography (growth and migrations) of human populations into a geographically realistic landscape, by taking into account the spatial and temporal heterogeneity of the environment. Generate the molecular diversity of several samples of genes drawn at any location of the current human's range, and compare it to the observed contemporary molecular diversity. SPLATCHE uses a region sampling Bayesian framework that requires10 5 independent demographic and genetic simulations.

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) SPLATCHE (2) Comparison of 4 different demographic models of human evolution, using a new set of nuclear markers Results 40-min jobs are a good compromise between # of CPUs and # of jobs 2 mio simulations 4000 jobs About 80 CPU-days per try 2 tries had 0% job failure 2 other tries had about 2-3% job failures In black: relative proportion of genes brought by first farmers in various European regions (maximum=32%) Expansion of first farmers 45,000-30,000 years ago8,000-3,000 years ago Expansion of early modern Europeans Probability distribution of demographic / genetic parameter of interest (admixture, migration, growth rates, etc…) Simulation replicates on the grid

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) Given the metabolic network of an organism, the application will screen high- throughput data derived from Protein-Protein Interaction and DNA array experiments in several conditions (for example a disease and a control condition) and identify important nodes in the network that show significant concentration changes: Pure Metabolic Model: e.g. from KEGG (>1400 reactions) Metabolic & Signal-Transduction pathways: e.g. from Reactome (>1500) Large-Scale Pathways Analysis (1)

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) Cancer application Type-2 diabetes Melanoma celllines: [primary tumor vs. metastases (tumor progression) vs. control] Homepage of ESBIC-D (EU project): Nutrigenomik (BioProfile – BMBF – Germany): Mouse model (NZO): [standard diet vs. high fat diet] Applications Large-Scale Pathways Analysis (2)

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 16 IN: 2-50 MB OUT: MB lfc/grid-ftp scp lfc/grid-ftp scp/lfc/grid-ftp Application for analysis of microarray and proteomics data with Support Vector Machine (SVM) classifiers; with IFOM-FIRC -- BICG AIRC project Standard LCG user interface commands are used to transfer a.Data + experiment design (setup db) lcg-cp/grid-url-copy db from local to SE b.Application edg-job-submit BioDCV.jdl (jdl file) c.Resulting db: lcg-cp/grid-url-copy db (from SE to local) BioDCV

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 17 BioDCV: Running on Microarray data Breast cancer microarray dataset genes and 183 samples 4Mega footprint units (footprint = #features x #samples) Original work in (Sotiriou et al, J. Nat Canc Inst 2006) –September/October 2006 –Used: 60 CPUs and about 40 Biomed sites. –20 CPUs x 3 series (alternative machine learning models): RFE-Linear SVM, TR (Terminated Ramp) SVM, Correlation-aware RFE-SVM. –Failure: 5 % (3 jobs) –Running times (average over 20 runs): Linear SVM ~ 5 hours TR SVM ~ 8 hours Correlation-Aware ~ 15 hours

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 18 PhyloJAVA Phylojava is a client/server tool dedicated to phylogenetic tree reconstruction. –This program allows phylogenetic tree inferences according to most usual methods (distance methods, maximum parsimony, maximum likelihood). Phylogenetic trees are computed on a remote server, and are sent via internet to a graphical interface (the client) that allows the user to handle alignments and phylogenetic trees. The user therefore only has to install the graphical interface on his computer, and can submit tree reconstruction jobs on a remote server (EGEE grid or his computers). Status: –Porting to the EGEE grid. Data sets need to be analysed –about 300 sequences of more than 6000-characters-long each. –weeks of computation with the current bootstraping algorithms

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 19 BiG (1) BLAST in Grids (BiG) –Grid Interface to MPI Blast. –Access Through a Web Portal ( –Access to EELA Grid Through Gate-to-Grid Using a WSRF Interface. FASTA File (Input Sequence) AGTACGTAGTAGCTGC TGCTACGTGGCTAGCT AGTACGTCAGACGTAG ATGCTAGCTGACTCGA FASTA File (Input Sequence) AGTACGTAGTAGCTGC TGCTACGTGGCTAGCT AGTACGTCAGACGTAG ATGCTAGCTGACTCGA Execution Parameters Execution Parameters Protein Database (Non Redunda nt e.g.) Output Matches Xxxxx x x x x x xxx xx xxx x Output Matches Xxxxx x x x x x xxx xx xxx x

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 20 BiG (2)

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 21 SuperlinkOnline Superlink online : a tool for genetic linkage analysis –Genetic Linkage Analysis is about hunting for disease-provoking genes. Tasks are automatically divided into small pieces and executed simultaneously using many computers: Executable is pre-installed, very small I/O From Level1 to L5 are from short tasks to very hard ones EGEE Biomed VO addresses L4 very hard tasks Statistics –7000 CPU-hours a day on ~3000 CPUS (Condor in Madison and Technion) –20-40 runs daily 1-3 runs of 10k jobs (15-30 min each) 5-10 runs of jobs (15 min) The rest are <30 jobs of up to 15 min –Workload will increase with new functionalities.

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 3DEM (1) Selected by their user impact MLalign3D –The combination of images in a 3D reconstruction requires: That they represent projections of identical 3D objects. That their relative orientations be known. –MLalign3D combines the tasks of Classifying the images in homogeneous groups. Aligning the images to obtain the best orientation. –MLalign3D employs a Maximum Likelihood method. MLalign2D –Is a similar to the 3D alignment case, only simpler

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 3DEM (2) DIANE/MLalign2D jobs evolution MonAlisa plot MLalign2D 37 Tasks (each a subset of 10 images)

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 24 Grid for Bioinformatics Very different applications … –Different requirements and priorities –Different resources involved: hardware, software, human –Different Life science communities addressed … but common requirements –End-users dont care of the infrastructure ! –Data Deploying updatable databases Security of biological data –Tools Integrating numerous, complex programs: automatic procedure Legacy application: grid-enabled without modification, SDJ, bundle, parallel job requiring MPI Portal and user interfaces Current major issues –Workload Management short job (< 5min): 2-3 min of overhead bundle jobs: very long time submission (12h for 4,000 jobs) –Data management: no tool in gLite to integrate database –Security: data confidentiality, encryption Portal certificate management of long authentication (proxy)

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 25 Next Bioinformatics Meetings EGEE Bioinformatics #3 –Valencia (Spain), February 2007, University of Valencia –Hosts: Vicente Hernandez, Ignacio Blanquer EGEE Bioinformatics #4 (joint with EU Bioinfogrid) –Varenna (Italy), May 2007 –Host: Luciano Milanesi EGEE Bioinformatics #5 (joint with SwissBioGrid) –Lugano (Swiss), Sept 2007 –Host: Peter Kunszt Bioinformatics meetings are standing during 2 days –One day for Internal EGEE bioinformatics activity report and discussion –One day for networking activity with external applications and projects workshop/tutorial about useful services: EGEE or 3rd-party ones

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 26 Next project meetings EGEE User Forum 2 – Manchester, UK, May 9-11, 2007 – Conjointly with OGF 20 (May 7-9) – Provide opportunities for an active dialogue between the EGEE project and its users (talks, demos, posters) – Call for Abstracts is openhttp:// EGEE07 – Budapest, Hungary, 1-5 October 2007 – Key European event dedicated to Grid technology: EGEE annual conferences are regularly attended by a large international Grid community coming together to discuss a wide range of issues, the latest developments, and international co-operation, with the aim of driving forward world-class Grid technologies.

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 27 Conclusion Bioinformatics community on EGEE Grid –10 Applications In production: Splatche, bioDCV Prototype: Porting: Large Scale Pathway, BiG, 3DEM, … –Provide expertise to port applications Benefit from EGEE grid, largest platform in production mode Collaboration with related projects: EU EMBRACE, EU EELA, EU BIOINFOGRID, SwissBioGrid. Open to new applications: contact us.

Enabling Grids for E-sciencE INFSO-RI EGEE Bioinformatics Activity, Dr. C. Blanchet (Resp.) 28 Build the Bioinformatics Grid Bioinformatics services for developers –Internal: integrate data and tools as grid services, –External: powerful interfaces: e.g. Web Services High-level interfaces for end-users –User-friendly: Web Portal for biologists, physicians –Efficient: integrated data and tools Powerful interface to display Grid-scale results –Thousands to millions of bioinformatics jobs –Graphical and Data-mining tools