its potential for Grid and Cloud Computing Tan Tin Wee Dept of Biochemistry Yong Loo Lin School of Medicine National University of Singapore
Merging two trajectories LiveCD/DVD/USB/VM eScience and Grid Computing APBioGrid : Globus Grid 1.1.6/7 – Ong Guan Sin (2002) APBioBox 1 and 2 with Sun’s BioClusterGrid/BioBox and KOOPLite workflow integration APBioKnoppix 1 and 2 LiveCD - Derek Kiong, Justin Choo ( ) BioSlax LiveCD - Mark de Silva and Lim Kuan Siong ( ) BioSlax LiveDVD, LiveUSB, VMplayer ( ) BioSlax Server ( ) BioSlax7.5-VMplayer-MokaFive (2009) UDBlast-BioSlax-VMplayer-MokaFive (2009) SINGAREN (1998) Telemanufacturing, eScience, remote control of Synchrotron beam line ( ) NUS BioGrid and APBioGrid ( ) National Grid and LSVGC database ( ) BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) ( ) - KOOPrime’s KOOPlatform - SIMTech and NTU - APSTC Sun Microsystems TeraCampus Grid ( ) 1 st World Wide Workflow Grid - Chris Baker (2008)
NUS BioGrid and APBioGrid
APBioBox
National Grid Office and LS VGC
APBioKnoppix ( ) Bioinformatics in a LiveCD Used by 500 NUS students a year in Life Science Module LSM2104 from 2004 to 2007
Integrated Biolaboratory Manufacturing and Services System A TSRP pilot project Group 9 Project Integrated Manufacturing and Services Systems (IMSS) For acceleration of specific areas of manufacturing, in particular biomanufacturing. www2.jp.apan.net/meetings/bangkok2005/presentation/tantinwee-APAN19-eScienceBKKJan05.ppt
1st WWWFG
TeraCampusGrid
Grid Apps on Grid svu/services/gridapps.htm
UDBlast at
Merged two trajectories LiveCD/DVD/USB/VM eScience and Grid Computing APBioGrid : Globus Grid 1.1.6/7 – Ong Guan Sin (2002) APBioBox 1 and 2 with Sun’s BioClusterGrid/BioBox and KOOPLite workflow integration APBioKnoppix 1 and 2 LiveCD - Derek Kiong, Justin Choo ( ) BioSlax LiveCD - Mark de Silva and Lim Kuan Siong ( ) BioSlax LiveDVD, LiveUSB, VMplayer ( ) BioSlax Server ( ) BioSlax7.5-VMplayer-MokaFive (2009) UDBlast-BioSlax-VMplayer-MokaFive (2009) SINGAREN (1998) Telemanufacturing, eScience, remote control of Synchrotron beam line ( ) NUS BioGrid and APBioGrid ( ) National Grid and LSVGC database ( ) BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) ( ) - KOOPrime’s KOOPlatform - SIMTech and NTU - APSTC Sun Microsystems TeraCampus Grid ( ) 1 st World Wide Workflow Grid - Chris Baker (2008) Moka5+VMWare+BioSlax On
( )
BioSlax with VMplayer Bioinformatics software in a virtual machine Packages Applications Programming languages – C, perl, python etc GUI interfaces Servers – apache, mysql No need to reboot, just install VMplayer, run BioSlax vmx file and create a new instance of a BioSlax virtual machine
Rapid and Easy Customisable Interface
Addable Modules Modules e.g. newmodule.lzm Stored in /mnt/sda1/modules Activated during boot up Compile –> dir2lzm -> copy into virtual HDD /mnt/sda1/module
Persistent Changes changes=/mnt/sda1/changes To change the boot up sequence for starting any program Add /etc/rc.d/rc.newprog
BioSlax 7.5 Bootable with changes LiveCD, LiveDVD, LiveUSB BioSlax-VMplayer version MokaFive version portable on Intel Machines What has it/can it be used for? Desktop-as-a-Service UDGrid mpagent ftp://sf01.bic.nus.edu.sg/incoming/bioslax/fixes/zz01_gr id_mpagent.lzm
BioSlax with VMplayer+MokaFive Downloadable from MokaFive server Multiple instantiations Shoppable Cacheable LivePC No need for installation Portable on a USB Rapid deployment – user initiated
BioSlax+VM+MokaFive: Desktop-as-a-Service
Desktop-as-a-Service
Grid-enabling BioSlax: UD MP agent in BioSlax History – UDBlast and UDHMMer Rapid recruitment of machines No need for user installation Push or pull instances Enduser joining of Grid
Joining a virtual machine to
UD-BioSlax-VMware-Moka5 How Desktops-as-a-Service can be used for harnessing free cpu cycles quickly End-users and members of the public can contribute “Contribute-to-Benefit” model – enduser can log in to monitor their “credits” depending on how much CPU time they have contributed and how much they pay. Creation of public participation grid computing and public recruitment of grid-enabled processors without interfering with Enduser’s machine. Inter-Grid functionality – deploy instances of UD- BioSlax on multiple clouds
1.UD Mpagent updates require reinstallation. 2.Variation of the scheduler requires reinstallation 3.Endusers have to install the mpagent. 4.MPagent not portable 1.Moka5 can do push the update Alternatively, BioSlax can do PXE boot 2.Endusers can run the agent whenever they want to 3.MPagent wrapped in BioSlax, played in Vmplayer, pushed by Moka5 is portable
Future Work BioSlax LivePC pushed to clouds End users can have BioSlax as a development platform that will work exactly as the BioSlax pushed to the cloud. Nutch-Hadoop configured on BioSlax and pushed onto Amazon EC2, for example Bioinformatics datasets will be permanently pushed to EC2 as a common accessible database by data providers for end-users to compute