XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.

Slides:



Advertisements
Similar presentations
Bioinformatics Platform Three-tier Architecture Object-based Relational Database implemented using Oracle Middleware implemented using Entity-Class Operations,
Advertisements

DATE: 2008/03/11 NCHC-Grid Computing Portal (NCHC-GCE Portal) Project Manager: Dr. Weicheng Huang Developed Team: Chien-Lin Eric Huang Chien-Heng Gary.
Architectural Constraints on Current Bioinformatics Integration Systems Norman Paton Department of Computer Science University of Manchester Manchester,
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
High Performance Computing Course Notes Grid Computing.
Grid & Libraries, 10/18/04.1 Second Invitational Berkeley – Academia Sinica Grid Digital Libraries Workshop, Taipei, October 18, 2004 Grid Middleware Application.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
NYU Microarray Database (NYUMAD)
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
2006 IEEE International Conference on Web Services ICWS 2006 Overview.
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Software – Part 3 V.T. Raja, Ph.D., Information Management College of Business Oregon State University.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
Principles for Collaboration Systems Geoffrey Fox Community Grids Laboratory Indiana University Bloomington IN 47404
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Grappa: Grid access portal for physics applications Shava Smallen Extreme! Computing Laboratory Department of Physics Indiana University.
DISTRIBUTED COMPUTING
CoG Kit Overview Gregor von Laszewski Keith Jackson.
Fundamentals of Database Chapter 7 Database Technologies.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
9.351 Systems Analysis & DesignDistributed Systems & User Interface1 Distributed Systems Distributed system = IS that contains a network component and.
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
SPREAD TOOLKIT High performance messaging middleware Presented by Sayantam Dey Vipin Mehta.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Web Services for Satellite Emulation Development Kathy J. LiszkaAllen P. Holtz The University of AkronNASA Glenn Research Center.
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
Commodity Grid Kits Gregor von Laszewski (ANL), Keith Jackson (LBL) Many state-of-the-art scientific applications, such as climate modeling, astrophysics,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Genomes to Grids Thoughts on Building Data Grids for Biology Biologists have discovered many millions of genes and genome features, now part of the bio-data.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Symphony A Java-Based Composition and Manipulation Framework for Computational Grids Dennis Kafura Markus Lorch This work is supported by the Virginia.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
Introduction to Web Services. Agenda Motivation History Web service model Web service components A walkthrough examples.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Applications of the Globus Toolkit Butterfly Grid ( Applications of the Globus Toolkit Butterfly Grid (
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Virtual Lab AMsterdam VLAMsterdam Abstract Machine Toolbox A.S.Z. Belloum, Z.W. Hendrikse, E.C. Kaletas, H. Afsarmanesh and L.O. Hertzberger Computer Architecture.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
WEB-BASED APPLICATION for DIGITAL PATHOLOGY and MOLECULAR ANALYSIS Dave Billiter, BA, PMP, Tom Barr, BS, Mark Plaskow, BA, MCSD, Kathy Nicol, MD Research.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Collaborations and Interactions with other Projects
Cloud-Enabling Technology
Presentation transcript:

XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon Ph.D National Electronics and Computer Technology Center, Thailand

APBioNet Resources BioDatabases - BioMirrors, PDB Mirror, SRS BioComputing –Beowulf Linux cluster, BioNavigator, Vector NTI, BioTraining –S* Life Science Informatics Alliance, Hypercourse in Online Bioinformatics Etc.

APBioNet BioDatabases

What’s missing ? I have a bioinformatics question. How do I search the answer ?

Sample Question What proteins in rice have families with a size greater than twenty members with at least one known structure, whose corresponding gene expression is activated under dry conditions, and that are involved in interactions with at least two other proteins ?

Query Mechanism Meta Server 1.Find genes in rice 2.Apply genes to protein DB 3.Check with known structures 4.Link to Gene expression DB 5.Link to 2-hybrid data 6.Link to functional annotation data QueriesResults What’s the communication protocol ?

eXtensible Markup Language (XML) Standard language of data exchange Very flexible for defining complex data structures Many supported tools such as XSL, DOM, Perl, and JAVA Less overhead when transform data from one to the other formats

XML-Based Grid Data System for Bioinformatics protein dry 2+

Grid Data System Grid technologies such as Globus enables sharing of geographically distributed content Create a virtual resource of biological data Our interest overlaps with the Commodity Grid Project –Exporting Grid technologies to our applications

Motivating Example: Alliance Science Portal, CoG Kit What we have learned about Commodity Grid: –Access and communicate with a variety of information sources –Ability to include remote computational resources –Performance guarantees –Portable user interfaces

Basic Integrated Grid Architecture Protocols, Authentication, Policy, Resource Management, Discovery, Events, etc. Protocols, Authentication, Policy, Resource Management, Discovery, Events, etc. Gene finding Gene finding Protein Prediction Protein Prediction Grid Services (Middleware) Grid Fabric (Resources) Applications Toolkits Data Grid Remote Computation Portals … … Storage Networks, Computer, Display Devices, etc. and associated local services

How To Use Grid? Commodity Grid Toolkits Low-level Grid Technologies Web Interface Genbank, EMBL, BLASTDB, etc.. Remote Data analysis service Local Data analysis service Applications & Toolkits Grid Services & Fabric

CoG Mapping to Grid fabric and services

Our Application: Rice* Building a genomic framework for research in rice –Providing information and computational resources *In collaboration with DNA Technology Laboratory, Kasetsart University and Computational Biology Research Group, University of Washington

Our focus on Grid technology Low-level utility components: –Build Data cache/replication as an application toolkit in the integrated Grid architecture Application-specific GUI components –A query interface for the genomic framework for rice research on the top layer (application) of Grid architecture

Questions ?