Bioinformatics Computation in the Cloud A Joint Collaboration Between Microsoft’s External Research and eXtreme Computing Groups

Slides:



Advertisements
Similar presentations
Challenges Facing Enterprise IT REDUCED MANAGEMENT NEW ECONOMICS INCREASED OPPORTUNITIES.
Advertisements

Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Cloud Computing for Research Roger S Barga Dennis Gannon, Jared Jackson, Wei Lu, Jaliya Ekanayake, Mohamed Fathalla Extreme Computing Group, MSR.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Emmanuel Mesas Microsoft Western Europe Leverage Azure Services & Platform with Existing Application.
The Center for Computational Genomics and Bioinformatics Christopher Dwan Mike Karo Tim Kunau.
Linux Platform  Download the source tar ball from the BLAST source code link  ncbi-blast src.tar.gz  Compilation  cd /BLASTdirectory/c++ ./configure.
Microsoft's Windows Azure. Microsoft's Windows Azure Platform is a cloud platform offering, that "provides a wide range of Internet services that can.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Overview Of Microsoft New Technology ENTER. Processing....
Unlock Your Data Rich connectivity Robust data integration Enterprise-class manageability Deliver Relevant Information Intuitive design environment.
Cloud Computing for Research Fabrizio Gagliardi Microsoft Research.
05 | Building 3D Visualizations using Power Map. Data & Analytics Self-service BI with the familiarity of Office and the power of the cloud.
Windows Azure for scalable compute and storage SQL Azure for relational storage for the cloud AppFabric infrastructure to connect the cloud.
Server Roles and Features.NET Framework 3.51.NET Framework 4.5 IIS Web Server IIS Default Document IIS Directory Browsing IIS HTTP Errors.
Parallel Data Analysis from Multicore to Cloudy Grids Indiana University Geoffrey Fox, Xiaohong Qiu, Scott Beason, Seung-Hee.
©2012 Microsoft Corporation. All rights reserved..
Microsoft Cloud Services Training and Certification Presented by Name Goes Here, Title.
SQL Server 2008 for Hosting Key Questions to Address How can SQL Server save your costs? How can SQL Server help you increase customer base? How can.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
> Utilize Windows Azure as integrated component of xRM solutions > Introduce new xRM capabilities in Dynamics CRM “5” > Demonstrate rapid development.
Getting Started with Windows Azure Name Title Microsoft Corporation.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Social Curation of Large Multimedia Collections on a Microsoft Azure Cloud Dazhi Chong, Samuel Coppage, Xiangyi Gu, Harris Wu Kurt Maly, Mohammad Zubair.
Your First Azure Application Michael Stiefel Reliable Software, Inc.
Introduction to Windows Azure BUGAEV ROMAN. Azure Windows Azure Platform is thus classified as platform as a service and forms part of Microsoft's cloud.
Service Computation 2010November 21-26, Lisbon.
CSIU Submission of BLAST jobs via the Galaxy Interface Rob Quick Open Science Grid – Operations Area Coordinator Indiana University.
Windows Azure Conference 2014 Designing Applications for Scalability.
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Bioinformatics Curriculum Issues, goals, curriculum.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Engineering Excellence Trustworthy Computing Microsoft Confidential IT – A Key Tool in Addressing Environmental Challenges IT has an important role to.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Cloud Computing Paradigms for Pleasingly Parallel Biomedical Applications Thilina Gunarathne, Tak-Lon Wu Judy Qiu, Geoffrey Fox School of Informatics,
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
What’s New in Windows Forms 2.0 Stephen Turner Software Design Engineer
DOE Network PI Meeting 2005 Runtime Data Management for Data-Intensive Scientific Applications Xiaosong Ma NC State University Joint Faculty: Oak Ridge.
Big Data Bioinformatics By: Khalifeh Al-Jadda. Is there any thing useful?!
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
User Scenarios in VENUS-C Focus on Structural Analysis Ignacio Blanquer I3M - UPV.
Windows Azure poDRw_Xi3Aw.
Executive Overview. Software modeling is essential, because it is the map that guides your developers. Additionally: Modeling Software  Visual information.
Enabling the Cloud OS Today  New high-density Web Sites with elastic cloud scaling and complete dev-ops experiences  New rich IaaS experience for self-service.
What is BLAST? Basic BLAST search What is BLAST?
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
N. Jacq – Bio informatics Tests WP n° 1 WP6-WP7-WP10 Biology applications on testbed 0 Laboratoire de Biologie des.
Microsoft Public Cloud Services Automation Excellence Marcel Zehner | Cloud Believer Innovation itnetX
SQL Server 2012 Session: 1 Session: 4 SQL Azure Data Management Using Microsoft SQL Server.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
Copyright c 2004 OSIsoft Inc. All rights reserved. Visualizing Performance Management Managing Information with RtPortal Gregg Le Blanc - OSIsoft Brian.
Introducing the Microsoft® .NET Framework
MATLAB Distributed, and Other Toolboxes
Platform as a Service.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
12/6/2018 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
microsoft cloud platform: enterprise-class architecture
Cloud Helps Company Scale to Demand for Growing Healthcare Provider Field MINI-CASE STUDY “Microsoft Azure gives us the opportunity to focus on the task.
Technical Capabilities
Parallel System for BLAST
Quickly Set Up Connections and Integrations with No Upfront Investment, Using Azure Power MINI-CASE STUDY “We built and designed Integration Cloud, our.
Developing Windows Azure Applications with Visual Studio
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Cloud Computing for Research
Microsoft Azure Services Platform
Presentation transcript:

Bioinformatics Computation in the Cloud A Joint Collaboration Between Microsoft’s External Research and eXtreme Computing Groups

Microsoft Biology Foundation (MBF) An open-source library of reusable bioinformatics algorithms and functions built on the.NET platform.

Excel 2010 Familiar, Powerful, Extensible 64-bit optimizations for large data support Comfortable, rich client experience when working with large genomic sequences. MBF Extension to Ribbon

Basic Local Alignment Search Tool (BLAST) One of the most widely used bioinformatics algorithms for comparing the local similarity between biological sequences. Computationally intensive: Querying 360,000 sequences against a 3 GB protein database takes ~1000 CPU hours. Cloud

AzureBlast Parallel BLAST engine running in Windows Azure. Performance Improvement: Speedup of 45x with 50 roles and 94x with 100 roles. With 300 roles computation time is reduced to 4 hours Execution Blast Web Service Blast Web Service Splitter Worker Role Splitter Worker Role BLAST Execution Worker Role n BLAST Execution Worker Role n Combiner Worker Role Combiner Worker Role BLAST DB Configuration BLAST DB Configuration Genome DB Genome DB Azure Blob Storage … … BLAST Execution Worker Role 1 BLAST Execution Worker Role 1 Researcher

SilverMap Silverlight and WPF Component Visualization “heat map” of AzureBlast results Partnership with Queensland University of Technology (QUT)

Seamless Experience Evaluate data and invoke computational models from Excel. Computationally heavy analysis done close to large database of curated data. Scalable for large, surge computationally heavy analysis. Test local, run on the cloud. Visualize results locally using Silverlight or WPF applications. MBF + Excel Researcher

Research for the Masses Exploit the power of the cloud for research on a laptop. Remove reliance on dedicated, large compute center investments. Enable easier collaboration and novel simultaneous research concepts.