Bioinformatics Community of CNGrid A New Approach to Utilizing Grids

Slides:



Advertisements
Similar presentations
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Advertisements

What’s New: Windows Server 2012 R2 Tim Vander Kooi Systems Architect
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
Cloud Computing projects in Engineering Harold Castro, Ph.D. Associate professor Systems and Computing Engineering COMIT (Communications and IT) Research.
SQL Server 2008 for Hosting Key Questions to Address How can SQL Server save your costs? How can SQL Server help you increase customer base? How can.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Understanding and Managing WebSphere V5
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
By Mihir Joshi Nikhil Dixit Limaye Pallavi Bhide Payal Godse.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
Customized cloud platform for computing on your terms !
Introduction to Cloud Computing
material assembled from the web pages at
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Breaking Barriers Exploding with Possibility Breaking Barriers Exploding with Possibility The Cloud Era Unveiled.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Windows Azure poDRw_Xi3Aw.
Integration of BioInformatics tools at NUS. GenBank Growth Chart Year Bases.
Planning Server Deployments Chapter 1. Server Deployment When planning a server deployment for a large enterprise network, the operating system edition.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Accessing the VI-SEEM infrastructure
Nithyamoorthy S Core Mind Technologies
Avenues International Inc.
Chapter 1: Introduction
Introduction to Distributed Platforms
Tools and Services Workshop
Web Application.
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
op5 Monitor - Scalable Monitoring
Prepared by: Assistant prof. Aslamzai
CaRT eCapacity Initiative Ghana Productivity Apps
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Chapter 1: Introduction
Platform as a Service.
Tools and Services Workshop Overview of Atmosphere
Cloud Management Mechanisms
The Client/Server Database Environment
The Future? Or the Past and Present?
VceTests VCE Test Dumps
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
NGS Oracle Service.
Holy Quran Application
Chapter 1: Introduction
Web Based Application Cloud services, in the form of centralized web-based applications, also appeal to the IT professional. One instance of an application.
Microsoft Braindumps with Verified Question Answers
Chapter 4.
Business Process Management Software
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Virtualization Techniques
Knowledge Based Workflow Building Architecture
Cloud Management Mechanisms
Collaborative Business Solutions
Module 01 ETICS Overview ETICS Online Tutorials
Cloud computing mechanisms
Language Processors Application Domain – ideas concerning the behavior of a software. Execution Domain – Ideas implemented in Computer System. Semantic.
Explore Evolution: Instrument for Analysis
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Client/Server Computing and Web Technologies
Containers and DevOps.
Presentation transcript:

Bioinformatics Community of CNGrid A New Approach to Utilizing Grids Yongwei Wu Tsinghua University wuyw@tsinghua.edu.cn Participants Sponsorship Beijing Institute of Genomics, CAS Tsinghua University

Outline Background Our Approach Achievements Concluding Remarks

Exponential Growth of Bio Data We need more storage and processing power to store and analyze these data!

Grids Draw Much Attention Bioinformatics is an important application domain of grid computing around the world!

Problems with Existing Bioinformatics Grids Practice Professionals and sharing are not well balanced Resources are limited in the environments built under the leadership of domain scientists. The scale is also limited in the environments built only by the Bioinformatics researchers. For those environments built based on general infrastructure, they are usually not professional, hard to use No support for sharing GUI software Not highlighting data’s support for computation Data synchronization, backup and storage are beyond the ability of domain users, whereas IT developers know little about application requirements. Covering only partial research activities No support of daily communication, results sharing, …

Outline Background Our Approach Achievements Concluding Remarks

Key Points Domain scientists lead the bioinformatics community development Develops Nova to support GUI software sharing Nova is a toolkit for customizing app environment Highlights data support for computation Storage can be attached to the computing environment Introduces new functionalities Knowledge repository, data/software sharing, Q&A system

Nova: A Virtual Computing Toolkit Nova aims to provide facilities for users to utilize physical infrastructures in an easier and more productive way. Customized Host Customized Cluster Customized Services Nova

Nova Architecture & Work Procedure Master Node Worker Nodes Information Service Configuration ② Query ③ Create VM ① Request Worker Selection Data Storage ④ OS Image VM ⑤ Start VM ⑦ Notification ⑧ VNC Remote Desktop Data Storage KVM/XEN Hypervisor ⑥ App Image VM Monitor

Nova Features Install-/configuration-free client users only need a Web browser to use the system High productivity pre-virtualized software and one-click configuration Inherent integration with storage cloud After VMs are created, personal space in storage cloud can be automatically attached as an independent driver, which then acts as a source for input data and as the destination of produced data

GUI Software Sharing by Nova User Request Nova Core Services Worker Selection Image Loading …

Providing Workflow Support To improve research efficiency further Both workflow definition tool and workflow services are supplied.

Knowledge Repository Hot Research Topics Important Journals Important Conferences/Workshops Famous Scholars Important Research Institutes Influential Surveys/Papers Important Organizations/Associations

Other Useful Resources Q & A System For users to help each other System Announcement Seminar Research Breakthrough Conference/Workshop CFP Newly added functions

Outline Background Our Approach Achievements Concluding Remarks

The Community is on Service 16

Sequence Format Conversion (17) 234 tools are integrated! DNA Analysis (28) (SequenceViewer, WinGene, etc.) RNA Analysis (19) (miRanda, RNAshapes, etc.) Protein Analysis (17) (InterproScan, InterViewer, etc.) Protein Structure (38) (Protein Explorer, RasTop, etc.) 234 Evolution Analysis (41) (MEGA, GeneTree, etc.) Sequence Assemble and Alignment(74) (BLAST, ClustalX, BioEdit, etc.) Sequence Format Conversion (17) (SeqVerter, DataConvert, etc.)

47 databases are provided! UCSC Genome full mirror + following

Community Usage More than 100 users now More than 60 institutes are involved. Users’ preference to the resources provided Software tools 77.38% Database 67.86% Knowledge Repository and others 32.14% Software Tool Database Knowledge Repository

Scenario for HnNn Analysis 20

Outline Background Our Approach Achievements Concluding Remarks

Concluding Remarks User- and task-oriented design is important The key to driving cloud computing successful The reason why we choose domain scientist-leading design Value-added services are important The key to attracting and retaining users The reason why we provide workflow support Challenges still ahead How to survive the data deluge? How to support new requirements?

Thanks!