Jason Stowe Condor Week 2009 April 22 nd, 2009. Coming to Condor Week since 2005. Started as a User.

Slides:



Advertisements
Similar presentations
Grid Computing at The Hartford OGF22 February 27, 2008 Robert Nordlund
Advertisements

University of Notre Dame
GPU Computing with Hartford Condor Week 2012 Bob Nordlund.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Building Data-intensive Pipelines Ravi K Madduri Argonne National Lab University of Chicago.
CLOUD COMPUTING & COST MANAGEMENT S. Gurubalasubramaniyan, MSc IT, MTech Presented by.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
Boston Bootcamp April 27 th, 2013 Azure Websites Udaiappa Ramachandran ( Udai
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
VS Anywhere. Visual Studio Industry Partner VS Anywhere NEXT STEPS Contact us at: Websitehttps://vsanywhere.com Blog- Facebook.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
W HAT IS H ADOOP ? Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Ian Alderman A Little History…
INFSO-RI Module 01 ETICS Overview Alberto Di Meglio.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
An Introduction to Progress Arcade ™ June 12, 2013 Rob Straight Senior Manager, OpenEdge Product Management.
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
Thinking Outside the Nest Utilizing Enterprise Resources with Condor Bob Nordlund The Hartford Condor Week 2006.
Condor In Flight at The Hartford 2006 Transformations Condor Week 2007 Bob Nordlund.
Grid Computing at The Hartford Condor Week 2008 Robert Nordlund
How AWS Pricing Works Jinesh Varia Technology Evangelist.
Tonido Cloud Private, Highly Scalable, Self-Hosted Cloud Storage/Sync Solution.
Derek Wright Computer Sciences Department University of Wisconsin-Madison MPI Scheduling in Condor: An.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
== Enovatio Delivers a Scalable Project Management Solution Minus Large Upfront Infrastructure Costs, Thanks to the Powerful Microsoft Azure Platform MICROSOFT.
Enterprise Cloud Computing
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
Stairway to the cloud or can we take the highway? Taivo Liik.
Virtualization Technology and Microsoft Virtual PC 2007 YOU ARE WELCOME By : Osama Tamimi.
Doug Benjamin Duke University. 2 ESD/AOD, D 1 PD, D 2 PD - POOL based D 3 PD - flat ntuple Contents defined by physics group(s) - made in official production.
Landing in the Right Nest: New Negotiation Features for Enterprise Environments Jason Stowe.
Information Systems in Organizations 5.2 Cloud Computing.
Experiments in Utility Computing: Hadoop and Condor Sameer Paranjpye Y! Web Search.
Tackling I/O Issues 1 David Race 16 March 2010.
Built on the Microsoft Azure Platform, UberCloud Helps Engineers and Software Providers to Offer and Deploy Powerful Cloud Services On Demand MICROSOFT.
Open J2EE Session Cluster December Open J2EE Session Cluster 2 Agenda Drivers Existing products Description Technology choices Demo Plan.
Windows Azure Backup Speaker Name Date Agenda Data protection challenges Introducing Windows Azure Backup How it works Security When to choose Windows.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Apache Hadoop on Windows Azure Avkash Chauhan
© 2015 MetricStream, Inc. All Rights Reserved. AWS server provisioning © 2015 MetricStream, Inc. All Rights Reserved. By, Srikanth K & Rohit.
1© Copyright 2016 EMC Corporation. All rights reserved.1 SCALEIO WITH CLOUDLINK.
HPC In The Cloud Case Study: Proteomics Workflow
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
MASS Java Documentation, Verification, and Testing
HPC In The Cloud Case Study: Proteomics Workflow
AWS Integration in Distributed Computing
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Hadoop Clusters Tess Fulkerson.
Using the Parallel Universe beyond MPI
CIS 332 Education for Service-- tutorialrank.com
AWS Cloud Computing Masaki.
Saranya Sriram Developer Evangelist | Microsoft
Cloud Computing: Concepts
Client/Server Computing and Web Technologies
Presentation transcript:

Jason Stowe Condor Week 2009 April 22 nd, 2009

Coming to Condor Week since Started as a User

Users hunger for features

AccountingGroups (2004/2005) Configuration w/Pipes (2005/2006) GroupResourcesUsed (2006/2007) Condor in Cloud (2007/2008) Resource Weights (2008/2009) Based upon customer requests

Focus on software development for managing Condor at any scale, and provide services that complement the technology

Universities, Fortune 500s, Government Labs, Small/Medium Businesses, that use Condor

Users like Condor because... It’s open, it works, flexible, (corporations) no lock-in API/Operating System, and...

The Community

Today, let’s talk about a few challenges, solutions

War Story #1: Compute & Data

Whenever you find or solve a computation problem, you discover a data problem.

“Dark” or Latent, Unused Storage on any OS/Device

Empty space dispersed across machines in unusable sizes

“We need more filer space, but we have empty space on all our machines.”

So we looked at Hadoop

New type of storage: Aggregated or “Cloud” Storage

Block Store Architecture

But how do we use it?

1.5 years ago: It works well to access it in Java, but what about mounting?

So we tried WebDAV

Next up, open source FUSE driver

Need: Windows/Linux, Reliable, Large Files, scalable, and Read/Write

Mountable drivers Linux(FUSE) / Windows (IFS)

CloudFS Architecture

When we rolled it out...

Customers Asked for Surprising Features HTTP/REST Protocols similar to Amazon S3 HTTP/REST Protocols similar to Amazon S3Reasons: Installing mountable driver across servers/workstations prohibitive Want similar interface to various cloud storage providers => Internal Cloud FTP Interface – Because it is simple! FTP Interface – Because it is simple!

Status Today

Mountable Multi-platform Drivers. Linux: SUSE 10, RHEL/CentOS 4&5, Windows 2k3 +, OSX 10.3+

Encryption to avoid snooping sensitive data

Data Nodes built on Java: Linux, Windows, OSX, Solaris

RESTful Storage Service & FTP interface

Management interface for controlling storage features (Integrating with CycleServer)

Looking forward to condor_hadoop!

War Story #2: Cloud Calculations

Condor users Peak vs. Median usage Problem

Need for compute power comes up suddenly

Condor Users hunger for resources

Condor users balance “We need more servers for big runs” and “Our servers are 40% utilized”

Many ways to solve this problem using EC2

Use cases do exist for adding nodes to a local condor pool using Amazon EC2

We favored entire pools in cloud

Data Scheduling, Performance issues

Run workflows faster using resources you could never buy...

can test CycleServer at a scale our users have and we don’t

Need 1000 node Condor Pool Wait 15 minutes

Dynamic Resources => Pool can be sized to the jobs Dynamic Resources => Pool can be sized to the jobs

1 corex 1000 hrs = 1000 core x 1 hr = ~$200

Sounds good, but how do we do this for a Workflow like BLAST?

From e-science 2008: For 64x the processors Hadoop Running Blast: 57x mpiBLAST: 52.4x

High-CPU Amazon EC2 nodes have best price/performance

Scalability: 2x CPUs = x 64 CPUS = 60.7x Speed-up

Why High Throughput leads to Efficient Computing

Another User: Worked with Varian - Mass Spectrometers Other High-Tech Lab Equipment

Problem: Coming up on a conference, needed to run a large simulation

Six Weeks On an internal Condor pool

Deployed a Condor pool in CycleCloud

Same 6-week Job

Ran < 1 Day

War Story #3: Management

Condor Tutorial mentions “Why use a personal Condor?” i.e. Condor on few nodes...

Condor on 1 computer Gets you policies, fault-tolerance, Etc.

Similarly, management issues come up even on small pools

Collaborating with U. of W. Madison

Managing Configuration Files (our Config with Pipes CW2006)

Exploring ClassAds/LogFiles becomes problematic

Visualization, Reporting, etc.

Man-decades on development of tools to assist running Condor

Have demo against Madison pool Come see me. We’d love more use cases

Questions? Thank you For more information go to: We constantly see opportunities for talented Condor folks, so please feel free to contact us! Jason Stowe jstowe - cyclecomputing.com