MyGrid: Personalised e-Biology on the Grid Professor Carole Goble Contact

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

Delivering User Needs: A middleware perspective Steven Newhouse Director.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Terminologies: An e-Science perspective Nicholas Gibbins Intelligence, Agents, Multimedia University of Southampton.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Principles of Personalisation of Service Discovery Electronics and Computer Science, University of Southampton myGrid UK e-Science Project Juri Papay,
ISWC 2005, Galway Seven Bottlenecks to Workflow Reuse and Repurposing Antoon Goderis Ulrike Sattler Phillip Lord Carole Goble University of Manchester.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
Simon Woodman Hugo Hiden Paul Watson Jacek Cala. Outline 1. What is e-Science Central? 2. Architecture and Features 3. Workflows and Applications.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
GADA Workshop 1-2 November 2005 Life Science Grid Middleware in a More Dynamic Environment Milena Radenkovic & Bartosz Wietrzyk The University of Nottingham,
On the Use of Agents in a BioInformatics Grid with slides from Luc Moreau, University of Southampton,UK myGrid.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
The my Grid project aims to provide middleware layers that make the Information Grid appropriate for the needs of bioinformatics. my Grid is building high.
Metadata in my Grid: Finding Services for in silico Science Dr Katy Wolstencroft myGrid University of Manchester.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
© 2006 Open Grid Forum Geoffrey Fox GFSG Meeting CWI Amsterdam December OGF eScience Function.
Deciding Semantic Matching of Stateless Services Duncan Hull †, Evgeny Zolin †, Andrey Bovykin ‡, Ian Horrocks †, Ulrike Sattler † and Robert Stevens †
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
CHESS seminar July 2005 Promoting reuse and repurposing on the Semantic Grid Antoon Goderis University of Manchester, UK CHESS seminar, 19 July 2005.
Taverna and my Grid Basic overview and Introduction Tom Oinn
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
MyGrid: Personalised e-Biology on the Grid Professor Carole Goble Contact e-Science.
My Grid: Upper level Grid Services for the Bioinformatican Prof. Carole Goble Sun Microsystems BioGrid Symposium, Baltimore, USA.
E-Science Tools For The Genomic Scale Characterisation Of Bacterial Secreted Proteins Tracy Craddock, Phillip Lord, Colin Harwood and Anil Wipat Newcastle.
Integrating BioMedical Text Mining Services into a Distributed Workflow Environment Rob Gaizauskas, Neil Davis, George Demetriou, Yikun Guo, Ian Roberts.
Contact person: Prof. M. Niezgódka Prof. Piotr Bała ICM Interdisciplinary Centre for Mathematical and Computational Modelling Warsaw University,
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
Tom Oinn, In general a grid system is, or should be : “A collection of a resources able to act collaboratively in pursuit of an overall.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Professor Carole Goble
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Anil Wipat University of Newcastle upon Tyne, UK A Grid based System for Microbial Genome Comparison and analysis.
Capture, integration, and sharing of functional genomic data Steve Oliver Professor of Genomics School of Biological Sciences University of Manchester.
Grids - the near future Mark Hayes NIEeS Summer School 2003.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Workflow in Grid Systems Workshop Dave Berry, Research Manager UK National e-Science Centre GGF10, Mar 2004.
Data and the UK e-Science Programme Paul Watson Director North-East Regional e-Science Centre School of Computing Science University of.
CaliBayes and BASIS: e-Science applications for Systems Biology research Yuhui Chen Institute for Ageing and Health Centre for Integrated Systems Biology.
MyGrid: open knowledge based high level services for bioinformatics the information Grid Professor Carole Goble University of Manchester, UK
Association of variations in I kappa B-epsilon with Graves' disease using classical and my Grid methodologies Peter Li School of Computing Science University.
GGF Summer School 24th July 2004, Italy Part 2: Architecture overview Professor Carole Goble University of Manchester
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
Remarks on OGSA and OGSI e-Science All Hands Meeting September Geoffrey Fox, Indiana University.
Utility Computing: Security & Trust Issues Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
PharmaGrid 2004, Switzerland, July Part 5: Wrap Up Professor Carole Goble University of Manchester
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
The Collaborative Semantic Grid David De Roure University of Southampton, UK
The my Grid Information Model Nick Sharman, Nedim Alpdemir, Justin Ferris, Mark Greenwood, Peter Li, Chris Wroe AHM2004, 1 September
Portals and my Grid Stefan Rennick Egglestone Mixed Reality Laboratory University of Nottingham.
1 A myGrid Project Tutorial (3) Dr Mark Greenwood University of Manchester With considerable help from Justin Ferris, Peter Li, Phil Lord, Chris Wroe and.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
MyGrid: Personalised Bioinformatics on the Information Grid Robert Stevens, Alan Robinson & Carole Goble University of Manchester & EBI, UK myGrid project.
Workflow and myGrid Justin Ferris IT Innovation Centre 7 October 2003 Life Sciences Grid GGF9.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
J. Douglas Armstrong Institute for Adaptive and Neural Computation, School of Informatics, University of Edinburgh. Bioinformatics at Edinburgh.
Taverna: A Workbench for the Design and Execution of Scientific Workflows Paul Fisher University of Manchester.
Provenance: Problem, Architectural issues, Towards Trust
1st International Conference on Semantics, Knowledge and Grid
Presentation transcript:

myGrid: Personalised e-Biology on the Grid Professor Carole Goble Contact

myGrid: Personalised e-Science on the Grid Personalised extensible environments for data-intensive in silico experiments in biology

e-Science & Biology Discovery is increasingly done in silico on results obtained from experiments using computational analysis & data repositories. A new era of collection based and simulation based science. prediction hypothesis analysis mining integration experiment results analysis mining integration

e-Science & Biology Discovery is increasingly done in silico on results obtained from experiments using computational analysis & data repositories. A new era of collection based and simulation based science. prediction hypothesis analysis mining integration experiment results curation analysis mining integration

e-Science & Biology Biology is a multi-faceted & increasingly multi-disciplinary science. Bioinformatics is an “e-Science”. –Discovery is done in silico on results obtained from experiments using a number of analysis & data resources. Molecular biology & genomics are our particular focus.

Circadian Rhythms Has anyone studied the effect of neurotransmitters on the circadian rhythms in Drosophila? How do the functions of the clusters of proteins from my experiment interrelate? What are the proteins with a particular function? Is a structure known for this protein and what other proteins have a similar structure? Can I build a homology 3D model? What is known about the homologous protein?

Information Weaving Large amounts of data & many applications. Highly heterogeneous. –Different types, algorithms, forms, implementations, communities, service providers Highly complex and inter-related. Highly volatile. Obstacles Everywhere

Descriptive knowledge

Circadian Rhythms 1.Has anyone else studied the effect of neurotransmitters on the circadian rhythms in Drosophila? 2.How do the functions of the clusters of proteins from my experiment interrelate? And what are the proteins with a particular function? 3.Is a structure known for this protein and what other proteins have a similar structure? 4.Can I build a homology 3D model? 5.What is known about the homologous protein?

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation.

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation.

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation.

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation.

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation.

E-Science Q & A Who else has asked this question & can I use/adapt their approach? –Workflow. What were the results at each stage? –Dynamic Data Repositories. When was P12345 last updated? Which BLAST did I use? –Provenance. Has PDB changed since I last ran this? –Notification Personalisation. 3 54

myGrid Objectives Straightforward discovery, interoperation, fusion, sharing of data, knowledge and workflows. Explicit management of workflows. –information & processes & best practice. Improving quality of experiments & data. –provenance & propagating change. Scientific discovery is personal & global. –personalisation & collaborative working. Security, ownership -> valuable assets.

myGrid Middleware removing the obstacles myGrid Middleware

Who is myGrid for? myGrid users biologists IS specialists infrequent problem specific bioinformaticians tool builders service provider systems administrators bioinformatics tool builders

myGrid Outcomes 1.e-Scientists –Environment built on toolkits for service access, personalisation & community. –Gene function expression analysis (fly & yeast). –Annotation workbench for the PRINTS pattern database. 2.Developers –Protocols and service descriptions. – my Grid-in-a-Box developers kit of core services. –Reference implementation services & applications. –Bio services – already delivered.

myGrid Pre-Prototype Portal Bioinformatic Services Personal Repository Metadata: Ontology Workflow Enactment Metadata: Service Directory Workflow Repository Bioinformatic Services

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client How do the functions of the clusters of proteins from my experiment interrelate? Locating a workflow

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client Locating a workflow

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client Locating a workflow

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client Locating a workflow

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client Locating a workflow

Portal Personal Repository Meta Data: Ontology Workflow Repository Meta Data: Service Type Directory Repository Client Ontology Client Workflow Client Locating a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Repos. Client Bioinformatic Services Personal Repository Workflow Enactment Service Directory 4 2 2? Provenance Data 3 Workflow Client Service Selection Client 1 Running a workflow

Two videos Experimental pre-prototype for requirements capture. How do the functions of a cluster of proteins interrelate? The other one, with provenance and temporary parking in repository.

myGrid Stack Metadata Services Coordination Services DataWorkflowDirectory Networked Services Applications Client Framework Governance DirectoryProvenancePersonalisation Semantic Services Info. ExtractionWorkflowOntology PortalUser AgentCollaboration Data Admin

myGrid generic technologies 1.Ontologies, Protocols & APIs. 2.Database access from the Grid. Reference implementation for UK DBTF. 3.Process enactment on the Grid. 4.Provenance services. 5.Metadata services. –From Semantic Web: DAML+OIL, RDF(S). 6.Personalisation services. 7.Reference implementation of OGSA.

Converging Technologies Agents Grid Computing Web Technologies Globus, Sun Grid Engine, Condor, DS (Jini, Corba) SOAP, WSDL, UDDI, WSFL DAML+OIL, OWL, RDF(S) ACL, methodology An early adopter for OGSA

The myGrid Team Carole Goble Norman Paton Brian Warboys Stephen Pettifer Luc Moreau Dave De Roure Chris Greenhalgh Tom Rodden John Brooke Paul Watson Alan Robinson Rob Gaizauskas Robert Stevens Ian Horrocks Neil Wipat Matthew Addis Nick Sharman Rich Cawley Simon Harper Karon Mee Simon Miles Vijay Dailani Xiaojian Liu Tom Oinn Martin Senger Milena Radenkovic Kevin Glover Angus Roberts Chris Wroe Mark Greenwood Phil Lord Neil Davis Darren Marvin Justin Ferris Peter Li Nedim Alpdemir Luca Toldo Robin McEntire Anne Westcott Tony Storey Bernard Horan Paul Smart Robert Haynes

myGrid Partners m

myGrid Summary myGrid aims to develop infrastructure middleware for an e-Biologist’s workbench. The setting is bioinformatics but the results are intended to be generally applicable to e-Science. A mix of standard, vanguard and bleeding edge technologies, advanced development and (some) research. Academic & commercial partnership. myGrid project is timely & reflects a community desire to “collaborate, or die”.

myGrid: Personalised e-Science on the Grid. Professor Carole Goble Contact