AHM September 2004 Grid Services Supporting the Usage of Secure Federated, Distributed Biomedical Data Dr Richard Sinnott Technical Director National e-Science.

Slides:



Advertisements
Similar presentations
Dr Richard Sinnott Dr Dave Berry 5 th February 2004 National e-Science Centre Local Developments Technical Director National e-Science Centre ||| Deputy.
Advertisements

AHM 2006 September 2006 DyVOSE Project: Experiences in Applying Advanced Authorisation Infrastructures John Watt (
DyVOSE Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
Grid Engineering Experience & Biological Applications Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics.
BRIDGES Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
High Performance Computing Course Notes Grid Computing.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
EDINA 20 th March 2008 EDINA Geo/Grid - Security Prof. Richard O. Sinnott Technical Director, National e-Science Centre University of Glasgow, Scotland.
Evidence-Based Information Retrieval in Bioinformatics
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
The Cell, Central Dogma and Human Genome Project.
UK e-Science and the White Rose Grid Paul Townend Distributed Systems and Services Group Informatics Research Institute University of Leeds.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
1 July 2005© 2005 University of Kent1 Seamless Integration of PERMIS and Shibboleth – Development of a Flexible PERMIS Authorisation Module for Shibboleth.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
NGS induction --- case study: the BRIDGES project Micha Bayer Grid Services Developer, BRIDGES project National e-Science Centre, Glasgow Hub.
E-Science Education Workshop, 1-2 Nov 2004 Teaching Grid Computing Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director.
Welcome e-Science in the UK Building Collaborative eResearch Environments Prof. Malcolm Atkinson Director 23 rd February 2004.
Bioinformatics.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
BRIDGES Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
The National e-Science Centre: Role and Activities Dave Berry, Research Manager Indo-UK Workshop on e-Science Delhi, February 2004.
Can Grids Deliver the Vision for Future Hypothesis Driven Life Science Research? Professor Richard Sinnott Technical Director National e-Science Centre.
Sys-Bio Talk, 24 th Feb 2005 Towards Grid-Based System Biology Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director (Technical)
The Environmental Genomics Thematic Programme Data Centre Dawn Field, Director.
Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University of Glasgow
Neil Geddes GridPP-10, June 2004 UK e-Science Grid Dr Neil Geddes CCLRC Head of e-Science Director of the UK Grid Operations Support Centre.
Future of e-Science Malcolm Atkinson Director 18 th March 2004.
Supporting further and higher education The Akenti Authorisation System Alan Robiette, JISC Development Group.
Portal-based Access to Advanced Security Infrastructures John Watt UK e-Science All Hands Meeting September 11 th 2008.
UK e-Science AHM th September 2005 Comparison of Data Access and Integration Technologies in the Life Science Domain Dr Richard Sinnott Technical.
JISC Middleware Security Workshop 20/10/05© 2005 University of Kent.1 The PERMIS Authorisation Infrastructure David Chadwick
Usability Talk, 26 th January 2006 Development of Usable Grid Services for the Biomedical Community Prof Richard Sinnott Technical Director National e-Science.
Grid enabled e-Research in the Life Sciences Prof Richard Sinnott Technical Director National e-Science Centre Anthony Stell University of Glasgow, Scotland,
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Oxford University e-Science Centre 1 Managing Access 4 Dec Managing Access to Resources on the Grid 4 December 2002.
OGF22 25 th February 2008 OGF22 Demo Slides Prof. Richard O. Sinnott Technical Director, National e-Science Centre University of Glasgow, Scotland
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
NeSC Workshop - February /14 Study of User Priorities for e-Infrastructure for e-Research (SUPER) Steven Newhouse Jennifer Schopf Andrew Richards.
IBM & HSBC visit Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 30 th March 2006.
Central dogma: the story of life RNA DNA Protein.
GridShib and PERMIS Integration: Adding Policy driven Role-Based Access Control to Attribute-Based Authorisation in Grids Globus Toolkit is an open source.
Authentication and Authorisation for Research and Collaboration Peter Solagna Milano, AARC General meeting Current status and plans.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
Dynamic Privilege Management Infrastructures Utilising Secure Attribute Exchange Dr John Watt Grid Developer, National e-Science Centre University of Glasgow.
Identity Management in DEISA/PRACE Vincent RIBAILLIER, Federated Identity Workshop, CERN, June 9 th, 2011.
European Life Sciences Infrastructure for Biological Information ELIXIR and Identity Management 2 nd Workshop on Federated Identity.
Standards driven AAA for Job Management within the OMII-UK distribution Steven Newhouse Director, OMII-UK
An Introduction to UK e-Science Anne E Trefethen Deputy Director UK e-Science Core Programme.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Supporting education and research The JISC Core Middleware Call Brian Gilmore The University of Edinburgh and JISC Committee for Support of Research.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
Virtual Organisations for Trials and Epidemiological Studies (VOTES) Overview VOTES is a pioneering project investigating the application of Grid technology.
High throughput biology data management and data intensive computing drivers George Michaels.
RC ICT Conference 17 May 2004 Research Councils ICT Conference The UK e-Science Programme David Wallace, Chair, e-Science Steering Committee.
Shibboleth Use at the National e-Science Centre Hub Glasgow at collaborating institutions in the Shibboleth federation depending.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
EGEE is a project funded by the European Union under contract IST Aims and organization of the Biomedical VO Yannick Legré CNRS/IN2P3 NA4/SA1.
J. Douglas Armstrong Institute for Adaptive and Neural Computation, School of Informatics, University of Edinburgh. Bioinformatics at Edinburgh.
Bob Jones EGEE Technical Director
Building Castles with Shifting Sands?
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Grid Portal Services IeSE (the Integrated e-Science Environment)
The JISC Core Middleware Call
Presentation transcript:

AHM September 2004 Grid Services Supporting the Usage of Secure Federated, Distributed Biomedical Data Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University of Glasgow 3 rd September 2004

AHM September 2004 Overview of BRIDGES Biomedical Research Informatics Delivered by Grid Enabled Services (BRIDGES) NeSC (Edinburgh and Glasgow) and IBM  Supporting project for CFG project Generating data on hypertension Rat, Mouse, Human genome databases Variety of tools used BLAST, BLAT, Gene Prediction, visualisation, … Variety of data sources and formats Microarray data, genome DBs, project partner research data, medical records, … Aim is integrated infrastructure supporting Data federation Security

AHM September 2004 Grids & Life Sciences Extensive Research Community >1000 per research university Extensive Applications Many people care about them  Health, Food, Environment, … Interacts with many disciplines Physics, Chemistry, Maths/Statistics, Nano-engineering, … Huge and expanding number of databases relevant to bioinformatics community Heterogeneity, Interdependence, Complexity, Change, Dirty… Linking in co-ordinated, secure manner full of open issues to be addressed Compute demands growing as more in-silico research undertaken

AHM September 2004 Database Growth PDB Content Growth DBs growing exponentially!!! Biobliographic (MedLine, PubMed…) Amino Acid Seq (SWISS-PROT, …) 3D Molecular Structure (PDB, …) Nucleotide Seq (GenBank, EMBL, …) Biochemical Pathways (KEGG, WIT…) Molecular Classifications (SCOP, CATH,…) Motif Libraries (PROSITE, Blocks, …)

AHM September 2004 Complexity of Biological Data Nucleotide sequences Nucleotide structures Gene expressions Protein Structures Protein functions Protein-protein interaction (pathways) Cell Cell signalling Tissues Organs PhysiologyOrganisms Populations + links to plant/crops, environmental, health, … information sources

AHM September 2004 More genomes …... Arabidopsis thaliana mouse rat Caenorhabitis elegans Drosophila melanogaster Mycobacterium leprae Vibrio cholerae Plasmodium falciparum Mycobacterium tuberculosis Neisseria meningitidis Z2491 Helicobacter pylori Xylella fastidiosa Borrelia burgorferi Rickettsia prowazekii Bacillus subtilis Archaeoglobus fulgidus Campylobacter jejuni Aquifex aeolicus Thermotoga maritima Chlamydia pneumoniae Pseudomonas aeruginosa Ureaplasma urealyticum Buchnerasp. APS Escherichia coli Saccharomyces cerevisiae Yersinia pestis Salmonella enterica Thermoplasma acidophilum

AHM September 2004 Bio e-Science Projects

AHM September 2004 Bridges Project Synteny Grid Service blast + VO Authorisation Information Integrator OGSA-DAI

AHM September 2004 Grid Security OGSA security Single sign-on based on (X.509) digital certificates  establish credentials –Certification authority based (RAL in UK) Services (and clients) have APIs for fine grained security  Based on GSS-API Provides for authentication but need authorisation  Various technologies for authorisation including PERMIS, CAS, … Collaborating with P rivil E ge and R ole M anagement I nfrastructure S tandards Validation (PERMIS) team  Lead by Prof David Chadwick, University of Salford –(

AHM September 2004 Security Authorisation PERMIS allows to Define roles for who can do what on what  Policy = { Role x Target x Action } –Can user X invoke service Y and access or change data Z? »Policies created with PERMIS PolicyEditor (output is XML based policy)

AHM September 2004 Security Authorisation PERMIS Privilege Allocator then used to sign policies Associates roles with specific users  Policies stored as attribute certificates in LDAP server When is authorisation done? Two main choices  Portal personalised for users based on their policies –If not allowed to invoke service then they do not get to see it  Actions of users (with given role) are authorised every time the service is invoked –They can see the service but potentially not be allowed to invoke it »Performance issues… but more likely scenario for authorisation  In both cases, if not explicitly agreed in policy then rejected and logged! –Both cases being explored Plan to exploit the GGF SAML AuthZ specification  Based on GT3.3 – currently have BLAST service in GT3.2Final –Identified issues with standards…

AHM September 2004 Where we are today! Information Integrator DB repository established and populated … with public data sets (OMIM, HUGO, RGD, SWISS-PROT) … linked to relevant resources (ENSEMBL- rat, human, mouse, MGI) GT3 based Grid services developed (BLAST) using own meta-scheduler General usage of ScotGrid and local Condor pool Portal developed using IBM WebSphere Genome visualisation browsers SyntenyVista – for viewing synteny between local/remote data sets MagnaVista – for exploring genetic information across multiple (remote) resources Gaining experience with security technologies Setting up policies with Grid security authorisation software etc Rolled-out Alpha version of system to CFG group July ‘04

AHM September 2004 Lessons learned Public data resources openness Often cannot query directly Often not easy/possible to find schemas Joint Data Standards Study investigating this  Started on 1 st June and involves –Digital Archiving Consultancy –Bioinformatics Research Centre (Glasgow) –NeSC (Edinburgh and Glasgow)  Look at technical, political, social, ethical etc issues involved in accessing and using public life science resources –Will liase with NDCC –Interview relevant scientists, data curators/providers  8 month project with final report in January –Funded by MRC, BBSRC, Wellcome Trust, JISC, NERC, DTI GT3 not without pain! (… understatement!!!!) Hopefully GT4 will be better?

AHM September 2004

AHM September 2004