Data Placement and Task Scheduling in cloud, Online and Offline 2014.11.27 赵青 天津科技大学

Slides:



Advertisements
Similar presentations
February 20, Spatio-Temporal Bandwidth Reuse: A Centralized Scheduling Mechanism for Wireless Mesh Networks Mahbub Alam Prof. Choong Seon Hong.
Advertisements

School of Computing FACULTY OF ENGINEERING Grids and QoS Grid Computing has emerged in the last two decades, initially as a model for large-scale, resource-intensive.
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
A Local-Optimization based Strategy for Cost-Effective Datasets Storage of Scientific Applications in the Cloud Many slides from authors’ presentation.
SLA-Oriented Resource Provisioning for Cloud Computing
Low Cost, Scalable Proteomics Data Analysis Using Amazon's Cloud Computing Services and Open Source Search Algorithms Brian D. Halligan, Ph.D. Medical.
Query Processing of Massive Trajectory Data based on MapReduce Qiang Ma, Bin Yang (Fudan University) Weining Qian, Aoying Zhou (ECNU) Presented By: Xin.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
The major IT companies, such as Microsoft, Google, Amazon, and IBM, pioneered the field of cloud computing and keep increasing their offerings in data.
Gueyoung Jung, Nathan Gnanasambandam, and Tridib Mukherjee International Conference on Cloud Computing 2012.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Present By : Bahar Fatholapour M.Sc. Student in Information Technology Mazandaran University of Science and Technology Supervisor:
Kick-off meeting 3 October 2012 Patras. Research Team B Communication Networks Laboratory (CNL), Computer Engineering & Informatics Department (CEID),
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
Energy Model for Multiprocess Applications Texas Tech University.
Telco Clouds: Modelling and Simulation
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Abstract Cloud data center management is a key problem due to the numerous and heterogeneous strategies that can be applied, ranging from the VM placement.
Energy-aware Hierarchical Scheduling of Applications in Large Scale Data Centers Gaojin Wen, Jue Hong, Chengzhong Xu et al. Center for Cloud Computing,
Department of Computer Science Engineering SRM University
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
E-STAB: Energy-Efficient Scheduling for Cloud Computing Applications with Traffic Load Balancing Dzmitry KliazovichUniversity of Luxembourg, Luxembourg.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Network Aware Resource Allocation in Distributed Clouds.
Cloud Computing Energy efficient cloud computing Keke Chen.
Cluster Reliability Project ISIS Vanderbilt University.
USTH Presentation Power-aware Scheduler for Virtualization TRAN Giang Son Prof. Daniel HAGIMONT Oct 19th, 2011.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Challenges towards Elastic Power Management in Internet Data Center.
CloudNaaS: A Cloud Networking Platform for Enterprise Applications Theophilus Benson*, Aditya Akella*, Anees Shaikh +, Sambit Sahu + (*University of Wisconsin,
Papers on Storage Systems 1) Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud, SC ) Making Cloud Intermediate Data Fault-Tolerant,
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
A Hierarchical MapReduce Framework Yuan Luo and Beth Plale School of Informatics and Computing, Indiana University Data To Insight Center, Indiana University.
© 2012 IBM Corporation Platform Computing 1 IBM Platform Cluster Manager Data Center Operating System April 2013.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
Energy-Aware Resource Adaptation in Tessellation OS 3. Space-time Partitioning and Two-level Scheduling David Chou, Gage Eads Par Lab, CS Division, UC.
Dzmitry Kliazovich University of Luxembourg, Luxembourg
A Fully Automated Fault- tolerant System for Distributed Video Processing and Off­site Replication George Kola, Tevfik Kosar and Miron Livny University.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
Operating System. Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems Multiprocessor Systems Distributed Systems Clustered.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Resource Management Model of Data Storage Systems Oriented on Cloud Computing Elena Kaina Yury Korolev.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Introduction to: Tycoon A Market Based Resource Allocation System by Alejandro García López.
ECE 692 Power-Aware Computer Systems Final Review Prof. Xiaorui Wang.
Low Carbon Virtual Private Clouds Fereydoun Farrahi Moghaddam, Mohamed Cheriet, Kim Khoa Nguyen Synchromedia Laboratory Ecole de technologie superieure,
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
A Hierarchical Edge Cloud Architecture for Mobile Computing IEEE INFOCOM 2016 Liang Tong, Yong Li and Wei Gao University of Tennessee – Knoxville 1.
GreenCloud: A Packet-level Simulator of Energy-aware Cloud Computing Data Centers Dzmitry Kliazovich, Pascal Bouvry, Yury Audzevich, and Samee Ullah Khan.
Reinforcement Learning Based Virtual Cluster Management
Astronomical Data Processing & Workflow Scheduling in cloud
Lesson Objectives Aims Key Words
Virtual laboratories in cloud infrastructure of educational institutions Evgeniy Pluzhnik, Evgeniy Nikulchev, Moscow Technological Institute
Computing Resource Allocation and Scheduling in A Data Center
Adaptive Cloud Computing Based Services for Mobile Users
Hui Chen, Shinan Wang and Weisong Shi Wayne State University
Digital Processing Platform
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
Resource-Efficient and QoS-Aware Cluster Management
Progress Report 2017/02/08.
Gordon McKenna | MVP & CEO | Inframon
Presentation transcript:

Data Placement and Task Scheduling in cloud, Online and Offline 赵青 天津科技大学

Motivation ●Increase the corresponding speed and throughput ●Guarantee QoS ●Energy Efficient and Green Computing

Overview ●Data placement for data-intensive application ●Task scheduling for QoS and energy efficiency ●Online task scheduling

1. Data Placement for data-intensive application ●Data clustering based on data correlation if put every 2 at different nodes? how much data transfer amount would be increased BEA Hierarchical clustering tree Objective: Place the close-related data items together so as to decrease data transfers Contributions: 1.Introduced data size factors 2.Issued “First Order Conduction Correlation” from intermediate data

●Data distribution Storage capacity, computation load balance “Tree-to-Tree” greedy allocation strategy Modified PSO algorithm 1. Data Placement for data-intensive application ●Cloud platform modeling Physical network structure/ BEA Objective: Make the frequent data movements happene on high-speed channels so as to improve network utilization and the efficiency of the whole cloud system.

1. Data Placement for data-intensive application ●Runtime data placement —Newly generated datasets will be saved to the data center which has the maximum dependency with it —The cost of re-distribution itself will also be taken into account. ●Results: by the greedy allocation strategies

2. Task Scheduling and Virtual Machine Allocation ●Objective: —Distribute the tasks with strong data dependences to the servers on a high-bandwidth connection, and turn off some of these servers with low utilization —Therefore: the response time can be reduced the utilization of system wide can be improved some idle network devices can also be turned off ●Task Clustering by —Hypergraph partitioning —BEA Transformation Efficient & Energy Saving!

2. Task Scheduling and Virtual Machine Allocation ●Requirement of tasks —Storage requirement —Computing Resource requirement: represented by VMs. ●Task Scheduling and Deadline constraint: Decrease the number of VMs as much as possible, while ensuring users’ Service Level Agreements.

2. Task Scheduling and Virtual Machine Allocation ●Physical machine allocation —Optimization objective: energy efficiency, high-bandwidth networks, load balance —Greedy Strategy: Each server’s energy efficiency TRD (Task Requirement Degree) Top-Down & Bottom-up: reduce data transfers, and improve network utilization Load balance —Constraint conditions: storage capacity, CPU and memory constraints —Other Methods: Genetic algorithms, PSO algorithms Optimal utilization level in terms of performance-per-watt: Commonly,

3. Online Scheduling ●Problems: —How to schedule the tasks in a fine-grained workflow? —How to deal with some variable conditions at runtime? ●Reinforcement learning based methods The goalof RL is to find the optimal policy parameter AgentEnvironment State s Action a Reward r

3. Online Scheduling Example: Cart-Pole Swing-up ●Task: swing up the pole by moving the cart ●State (2-D continuous): angle, and velocity of the pole ●Action (1-D continuous): force applied to cart ●Reward:

Thank for your time !