Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by CNIC, CAS Beijing, China Oct 16-18, 2013.

Slides:



Advertisements
Similar presentations
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD Karpjoo Jeong, Ph.D., Konkuk University, South Korea Habibah Wahab, Ph.D., USM, Malaysia.
Advertisements

PRAGMA Institute on PRAGMA 19 Wilfred W. Li, Ph.D., UCSD, USA Xiaohui Wei, Ph.D., JLU, PRC Hosted by JLU Changchun, Jilin, PRC, Sept 13,
X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by IOIT Hanoi, Vietnam, Oct 29, 2009.
Steering Committee Meeting Summary PRAGMA 18 4 March 2010.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Daejeon, Korea, March 24, 2009.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by JLU Changchun, Jilin, PRC, Sept 13-15, 2010.
Resource WG Summary Mason Katz, Yoshio Tanaka. Next generation resources on PRAGMA Status – Next generation resource (VM-based) in PRAGMA by UCSD (proof.
Biosciences Working Group Update & Report Back Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by IOIT Hanoi, Vietnam, Oct 29,
1 Applications Virtualization in VPC Nadya Williams UCSD.
PRAGMA19, Sep. 15 Resources breakout Migration from Globus-based Grid to Cloud Mason Katz, Yoshio Tanaka.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS
Invitation to Beijing PRAGMA25 Kai Nan, Kevin Dong, Peter Zhao Computer Network Information Center (CNIC), Chinese Academy of Sciences (CAS)
What is it? CLOUD COMPUTING.  Connects to the cloud via the Internet  Does computing tasks, or  Runs applications, or  Stores Data THE AVERAGE CLOUD.
Flexible Services for the Support of Research Project Overview.
Analysis of Remote Sensing Quantitative Inversion in Cloud Computing Jing Dong, Yong Xue, Ziqiang Chen, Hui Xu, Yingjie Li Institute of Remote Sensing.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia You-Qiang Song, Ph.D, HKU, PRC Hosted by HKU Hong.
Developing Reusable Software Infrastructure – Middleware – for Multiscale Modeling Wilfred W. Li, Ph.D. National Biomedical Computation Resource Center.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Active Folder : Integrating All Activities of Simulation on File System Suntae Hwang ( ) Daeyoung Heo ( ) School.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by AIST Sapporo, Japan, Oct 17-20, 2011.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Biosciences Working Group Final Update for PRAGMA 25 Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by CNIC, CAS Beijing, China.
Institute of Systems Biology (INBIOSIS)/ School of Biosciences & Biotechnology (Faculty of Science & Technology), Bioinformatics Development in Malaysia.
Projects. High Performance Computing Projects Design and implement an HPC cluster with one master node and two compute nodes. (Hint: use Rocks HPC Cluster.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Software Architecture
Cloud computing.
Active Folder: Integrating all activities of simulation on file system(with DropBox) Suntae Hwang & Daeyoung Heo School.
Yaoshen Yuan Tufts University Virtual Machine Usage in Cloud Computing in Google EE-126.
Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami.
Cloud Computing in NASA Missions Dan Whorton CTO, Stinger Ghaffarian Technologies June 25, 2010 All material in RED will be updated.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
CFD Cyber Education Service using Cyberinfrastructure for e-Science PRAGMA 15 DEMO SESSION Jongbae Moon Byungsang Kim
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
PRAGMA 17 – PRAGMA 18 Resources Group. PRAGMA Grid 28 institutions in 17 countries/regions, 22 compute sites (+ 7 site in preparation) UZH Switzerland.
Summary of Steering Committee Meeting 2013 October 18.
PRAGMA Avian Flu Grid: Drug Discovery against Infectious Diseases Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia.
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
A Hierarchical MapReduce Framework Yuan Luo and Beth Plale School of Informatics and Computing, Indiana University Data To Insight Center, Indiana University.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Biosciences Working Group Update Seok Jong Yu, KISTI Hosted by Kunkuk University Seoul, Korea, Oct 10-12, 2012.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
A Personal Cloud Controller Yuan Luo School of Informatics and Computing, Indiana University Bloomington, USA PRAGMA 26 Workshop.
SC2008 (11/19/2008) Resources Group Pacific Rim Application and Grid Middleware Assembly Reports.
Alex Read, Dept. of Physics Grid Activities in Norway R-ECFA, Oslo, 15 May, 2009.
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Document Name CONFIDENTIAL Version Control Version No.DateType of ChangesOwner/ Author Date of Review/Expiry The information contained in this document.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
PRAGMA19 – PRAGMA 20 Collaborative Activities Resources Working Group.
PRAGMA 25 Working Group Updates Resources Working Group Yoshio Tanaka (AIST) Phil Papadopoulos (UCSD) Most slides by courtesy of Peter, Nadya, Luca, and.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by UCSD San Diego, USA, March 3, 2010.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Norman Morrison Senior Research Fellow, The University of Manchester Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting.
Web 2.0: Concepts and Applications 6 Linking Data.
Canadian Bioinformatics Workshops
Accessing the VI-SEEM infrastructure
Biosciences Working Group Update
Prepared by: Assistant prof. Aslamzai
Review of Last PRAGMA 24 Meeting
Recap: introduction to e-science
Cloud Computing Dr. Sharad Saxena.
Shared Research Computing Policy Advisory Committee (SRCPAC)
Consortium: National networks in 16 European countries.
MMG: from proof-of-concept to production services at scale
Presentation transcript:

Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by CNIC, CAS Beijing, China Oct 16-18, 2013

Cloud based solutions for Biosciences Data as a Service (DaaS) Software as a Service (SaaS) Platform as a Service (PaaS) Infrastructure as a Service (IaaS) Network as a Service (NaaS) Data Sets, Public Records, Biological Databases Computational Pipelines, Specialized tools Execution Platforms, Programming Environments Virtual Machines, Virtual Clusters Network Resources, Programmable Networks

Rocks BioApp Utilizing Rocks Cluster Distribution IaaS: Virtual Machines SaaS: Utilizing the Opal toolkit Applications: AutoDock AutoDock Vina PDB2PQR MEME … PRAGMA Cloud: Using Gfarm for VM storage and sharing Contact: Nadya Williams 2013 Google Science Fair Grand Prize Winner, Eric Chen, a high school student in the UCSD BioChemCore program. New antivirals against Influenza endonuclease using virtual screening tools.

CNIC – Duckling Collaboration Library CLB - Collaboration Library – A component of Duckling, an open-source toolkit developed by the CNIC, Chinese Academy of Sciences (CAS) – Used by all Duckling applications as the Data Repository – Extended to support data cloud service (CLB+) Duckling Portal Resource Document Collaboration Tool (DCT) Collaboration Library (CLB) User Management Tool (UMT) Virtual Organization Tool (VMT) Duckling Application Integration Framework Resource Application Plug-ins Application Plug-ins Dong et al, IEEE e-science 2013, in press.

UCSD – Private Cloud Data service UCSD Research Cyberinfrastructur e Program Campus funded initiative to support big data applications Interviewed 50 groups on campus Li et al, IEEE e-science 2013, in press.

Mashup of Typical Research Data Flows Many research groups follow similar data flows Utilize a subset of the components on a routine basis Number of storage/replica nodes, computing nodes and instruments vary among the groups but the data flow and usage patterns are quite similar. NFS NFS/CIFSFTP ReplicateNFS/CIFS LustreArchive Share Network Attached Storage Nodes Network Attached Storage Nodes Replicated Storage Nodes Replicated Storage Nodes Compute nodes Laptop/Desktop On Campus External Servers Instruments High Performance File System Cloud storage nodes Cloud storage nodes

Update from Konkuk University Prepared a proposal for a government grant – Development of novel technologies for studying metagenomics based on cloud computing (Institutes: CBRU and BDRC at Konkuk University, SDSC and Calit2 at UCSD) Workshop proposal for PRAGMA26 – Theme: NGS, Metagenomics, HPC, Clouds and Collaboration, CFP out early next year. 7 Jaebum Kim Ph.D.

Update from Konkuk University Plan for an international consortium – Time: Jan (tentative) – Place: Konkuk University, Korea – Topic: Environment- and toxicity-related microorganism and bioinformatics – Institutes: UW-METC (Dr. Yu), Konkuk University, and more (tentative) – More information will be out soon – If you are interested let us know. We can invite you 8

Active Folder: Integrating All Activities of Simulation on File System Congratulation! Dr. Daeyoung Heo – He got the PhD degree in the last summer! Two posters & a Demo in WG – Active Folder: Integrating All Activities of Simulation on File System - NAS Version – Predicting and Forecasting System of Urban Ecology on Meteorological Changing Active Folder – good for case comparative study – Tasks Described as regular folders and files – Product Input or output of simulation Can be handled like regular file by using legacy software Contains provenance information (meta data, task info, etc) Can be reproduced by the task which is extracted from the provenance information – Resource Computing server(Local, Grid, Cloud, what ever, …) is registered as regular folders and files To submit a Job(task), just Drag&Drop the task folder to the folder which represents computing server Daeyoung Heo ( ) Suntae Hwang ( )

Active Folder: Integrating All Activities of Simulation on File System ⊙ Active Folder on DropBox+EC2 – Cost & Performance Problem with very large files ⊙ Active Folder on NAS ⊙ NAS(Network Attached Storage) ● Large Volume Storage ● Network File System ( NFS, SMB/CIFS, AFP … ) ● Most vendors support Cloud solution like DropBox at PRAGMA 25 at PRAGMA 24

Case Study : Volcano Eruption Simulation using Active Folder

Breakout Sessions Presentations (Today, 2:40 – 4:15 pm, Rm 514) – Kevin Dong, CNIC – Jaebum Kim, Konkuk University – Wilfred Li, UCSD – Daeyoung Heo, Kookmin University – Others, please let me know. Planning (Tomorrow, 11:10 am – 12:30 pm) Join Sessions (Tomorrow, 3:50 – 4:30 pm, Resources and Data, Cyber Learning) Conferences – IEEE e-Science 2013, 10/23-25/2013 (National Convention Center, Beijing)