MagnaData: Scheduling Complex Workflows with Non-Functional Requirements in Datacenters Laurens Versluis Massivizing Computer Systems @Large research.

Slides:



Advertisements
Similar presentations
Manage your technology for optimal return on investment (ROI) The Tivoli ® Configuration & Operations management solution from IBM.
Advertisements

1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Grids for Complex Problem Solving, 29 January 2003 Grid based collaborative working in large distributed organisations
Technology from seed Cloud-TM: A distributed transactional memory platform for the Cloud Paolo Romano INESC ID Lisbon, Portugal 1st Plenary EuroTM Meeting,
Achieving Elasticity for Cloud MapReduce Jobs Khaled Salah IEEE CloudNet 2013 – San Francisco November 13, 2013.
SLA-Oriented Resource Provisioning for Cloud Computing
Xavier León PhD defense
Energy Saving Software based on Cloud Computing for Adjustable Processing Environments (ESSCCAPE) The Green Cloud.
New Challenges in Cloud Datacenter Monitoring and Management
1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,
Scheduling a Large DataCenter Cliff Stein Columbia University Google Research June, 2009 Monika Henzinger, Ana Radovanovic Google Research.
COST IC804 – IC805 Joint meeting, February Jorge G. Barbosa, Altino M. Sampaio, Hamid Harabnejad Universidade do Porto, Faculdade de Engenharia,
Euro-Par 2007, Rennes, 29th August 1 The Characteristics and Performance of Groups of Jobs in Grids Alexandru Iosup, Mathieu Jan *, Ozan Sonmez and Dick.
A Survey of Mobile Cloud Computing Application Models
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
1 Distributed Process Scheduling: A System Performance Model Vijay Jain CSc 8320, Spring 2007.
1 Time & Cost Sensitive Data-Intensive Computing on Hybrid Clouds Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The.
GRID’2012 Dubna July 19, 2012 Dependable Job-flow Dispatching and Scheduling in Virtual Organizations of Distributed Computing Environments Victor Toporkov.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.
Automated Control in Cloud Computing: Challenges and Opportunities Harold C. Lim, Shivnath Babu, Jeffrey S. Chase, and Sujay S. Parekh ACM’s First Workshop.
Combining the strengths of UMIST and The Victoria University of Manchester Adaptive Workflow Processing and Execution in Pegasus Kevin Lee School of Computer.
10 th December, 2013 Lab Meeting Papers Reviewed:.
VO-Ganglia Grid Simulator Catalin Dumitrescu, Mike Wilde, Ian Foster Computer Science Department The University of Chicago.
Copyright © 2008 Accenture All Rights Reserved.Copyright © 2008 Accenture All Rights Reserved.Copyright © 2008 Accenture All Rights Reserved.Copyright.
Static Process Scheduling
Accounting for Load Variation in Energy-Efficient Data Centers
Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.
Optimize the Business with Microsoft Datacenter Services 2.0
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Enabling Dynamic Memory Management Support for MTC on NVIDIA GPUs Proposed Work This work aims to enable efficient dynamic memory management on NVIDIA.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
Intelligent Agents: Technology and Applications Unit Five: Collaboration and Task Allocation IST 597B Spring 2003 John Yen.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
The Post Windows Operating System
UNLEASH YOUR FULL POTENTIAL
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Measurement-based Design
Object-Oriented Analysis and Design
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
PLC Question 1: What do we expect all students to learn?
Introduction | Model | Solution | Evaluation
architecting the DIGITAL enterprise
PINPOINTBPS® PROPOSAL INNOVATING TOWARDS CERTAINTY
Amity University, Noida, India
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
AWS Batch Overview A highly-efficient, dynamically-scaled, batch computing service May 2017.
Server Allocation for Multiplayer Cloud Gaming
Agenda Welcome and overview (Peter)
Cloud Computing.
Let’s Discuss Cloud Services in India Are we going to talk about these beautiful clouds? Great.
Cloud Computing B. Ramamurthy 9/19/2018 B. Ramamurthy.
PASHTEK.COM.  Pashtek is an experienced salesforce consulting company in arizona focused on Salesforce solutions.  Pashtek have a strong team of experienced.
PASHTEK.COM.  Pashtek is an experienced salesforce consulting company in arizona focused on Salesforce solutions.  Pashtek have a strong team of experienced.
Pertemuan 22 Materi : Buku Wajib & Sumber Materi :
Vlad Nae, Radu Prodan, Thomas Fahringer Institute of Computer Science
Modernizing your enterprise with hybrid it
Proposal for Term Project Operating Systems, Fall 2018
Automating Profitable Growth™
GRUBER: A Grid Resource Usage SLA Broker
The Most In-Demand Skills for Cloud Computing.
Resource and Service Management on the Grid
The Performance of Big Data Workloads in Cloud Datacenters
Overview of Workflows: Why Use Them?
Competitive Industry Report and Calculations
Mihai Neacşu, BSc. Prof.dr.eng. Alexandru Iosup Ir. Laurens Versluis
Basics of Cloud Computing
1 Envision 3 Outline 4 Design
Welcome to (HT)Condor Week #19 (year 34 of our project)
Towards a frictionless social security
Presentation transcript:

MagnaData: Scheduling Complex Workflows with Non-Functional Requirements in Datacenters Laurens Versluis Massivizing Computer Systems @Large research https://atlarge-research.com

Massivizing Computer Systems Big Science Education for Everyone (Online) Business Services Online Gaming Grid Computing Datacenters Daily Life Massivizing Computer Systems focuses on large systems that impact society, ranging from education to datacenters and from Big Science to Online gaming. In this talk, we will mainly focus on this components <click> Cloud computing. (next slide… Because cloud popularity and its usage….)

Cloud popularity and usage at all-time high Surveys: 86% of companies use >1 cloud service, >$200B market by 2020 Efficient resource utilization increasingly important Reduce costs for client & provider Competitive position for companies Source: http://business.nasdaq.com/marketinsite/2017/Cloud-Computing-Industry-Report-and-Investment-Case.html

Workflow execution is popular Workflows = set of tasks with precedence constraints Usually represented as a Directed Acyclic Graph (DAG) Used to model applications in many domains Today: thousands of applications in use

Executing workflows in the cloud Workflows are submitted to the cloud, executed in datacenters Workflow resource demand changes over time due to their complex structures Non-trivial to efficiently schedule incoming workloads of workflows and allocate enough resources How many resources to acquire and when? Workload of workflows

The MagnaData Project: Overview

RM&S for Complex, Dynamic Data-Services Goal: The first comprehensive RM&S for cloud datacenters with per-component changes of performance and availability requirements. Four important concepts: Allocation & Provisioning policies How to efficiently schedule tasks? When to allocate a new machine? Etc. Fine granularity (task-based) Non-functional requirements How to specify NFRs? How to enforce them? What is the minimal specification required to enforce? Dynamic changes CCGrid’18: Compare eight provisioning policies (autoscalers) HotCloudPerf’18: Investigated three workflow formalisms on their NFR support

CCGrid’18: Compared 8 autoscalers in simulation Four distinct workload traces A workload is a set of workflows (applications) Use a rich set of metrics 10+ forms of elasticity Four experiments Different workload domains (new) Bursty workloads (deeper understanding) Impact of the allocation policy (new) Different resource environments (new)

HotCloudPerf’18: Formalisms for workflows with non-functional requirements Surveyed 116 papers from across 11 venues Investigated three most popular workflow formalisms DAG BPMN Petri net Main findings: No formalism is capable of specifying arbitrary non-functional requirements at the task level DAGs look the most promising to extend

Roadmap

Conclusion and ongoing work Main take-away message: The MagnaData project focuses on improving efficiency of scheduling in clouds. My work focuses on: Introducing task-based NFR Devise new allocation & provisioning policies Valorisation: collaborate with third parties Interested in this work? Let me know! The performance differs significantly per application domain All autoscalers perform similar on bursty workloads in terms of NSL The allocation policy has a direct impact on performance Suggests to co-design allocation and provision policies Some autoscalers overprovision more while yielding no better NSL Laurens Versluis Massivizing Computer Systems @Large research, https://atlarge-research.com

Cloud architecture overview

The current approach