1 On-line Parallel Tomography Shava Smallen UCSD.

Slides:

Advertisements

Similar presentations

Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)

Advertisements

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.

Load Balancing Parallel Applications on Heterogeneous Platforms.

Hadi Goudarzi and Massoud Pedram

Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.

SLA-Oriented Resource Provisioning for Cloud Computing

LOAD BALANCING IN A CENTRALIZED DISTRIBUTED SYSTEM BY ANILA JAGANNATHAM ELENA HARRIS.

CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Dynamic Planning Based Scheduling.

Resource Management of Highly Configurable Tasks April 26, 2004 Jeffery P. HansenSourav Ghosh Raj RajkumarJohn P. Lehoczky Carnegie Mellon University.

SKELETON BASED PERFORMANCE PREDICTION ON SHARED NETWORKS Sukhdeep Sodhi Microsoft Corp Jaspal Subhlok University of Houston.

Kuang-Hao Liu et al Presented by Xin Che 11/18/09.

1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.

The Organic Grid: Self- Organizing Computation on a Peer-to-Peer Network Presented by : Xuan Lin.

GridFlow: Workflow Management for Grid Computing Kavita Shinde.

A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter ： S.Y.Chen.

CSE 160/Berman Programming Paradigms and Algorithms W+A 3.1, 3.2, p. 178, 6.3.2, H. Casanova, A. Legrand, Z. Zaogordnov, and F. Berman, "Heuristics.

MCell Usage Scenario Project #7 CSE 260 UCSD Nadya Williams

Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.

Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.

Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling Shanshan Song, Ricky Kwok, and Kai Hwang University of Southern.

Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,

Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.

Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.

On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.

MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.

Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.

OPERATING SYSTEMS CPU SCHEDULING.  Introduction to CPU scheduling Introduction to CPU scheduling  Dispatcher Dispatcher  Terms used in CPU scheduling.

Predicting performance of applications and infrastructures Tania Lorido 27th May 2011.

OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.

Network Aware Resource Allocation in Distributed Clouds.

Parallel Tomography Shava Smallen CSE Dept. U.C. San Diego.

An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.

ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,

Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.

Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.

Performance Model & Tools Summary Hung-Hsun Su UPC Group, HCS lab 2/5/2004.

Jean-Sébastien Gay LIP ENS Lyon, Université Claude Bernard Lyon 1 INRIA Rhône-Alpes GRAAL Research Team Join work with DIET TEAM D istributed I nteractive.

A Survey of Distributed Task Schedulers Kei Takahashi (M1)

Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,

Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.

Stochastic DAG Scheduling using Monte Carlo Approach Heterogeneous Computing Workshop (at IPDPS) 2012 Extended version: Elsevier JPDC (accepted July 2013,

CPU Scheduling Gursharan Singh Tatla 1-Feb-20111www.eazynotes.com.

1 Grid Scheduling Cécile Germain-Renaud. 2 Scheduling Job –A computation to run on a machine –Possibly with network access e.g. input/output file (coarse.

Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.

Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.

1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.

Real-Time Support for Mobile Robotics K. Ramamritham (+ Li Huan, Prashant Shenoy, Rod Grupen)

1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.

Rassul Ayani 1 Performance of parallel and distributed systems  What is the purpose of measurement?  To evaluate a system (or an architecture)  To compare.

MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.

Energy-Aware Resource Adaptation in Tessellation OS 3. Space-time Partitioning and Two-level Scheduling David Chou, Gage Eads Par Lab, CS Division, UC.

Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.

Multimedia Computing and Networking Jan Reduced Energy Decoding of MPEG Streams Malena Mesarina, HP Labs/UCLA CS Dept Yoshio Turner, HP Labs.

Introduction to Real-Time Systems

Parameter Sweep and Resources Scaling Automation in Scalarm Data Farming Platform J. Liput, M. Paciorek, M. Wrona, M. Orzechowski, R. Slota, and J. Kitowski.

Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.

Sunpyo Hong, Hyesoon Kim

Parallel Tomography Shava Smallen SC99. Shava Smallen SC99AppLeS/NWS-UCSD/UTK What are the Computational Challenges? l Quick turnaround time u Resource.

A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,

Assess usability of a Web site’s information architecture: Approximate people’s information-seeking behavior (Monte Carlo simulation) Output quantitative.

1 A Grid-Based Middleware’s Support for Processing Distributed Data Streams Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering.

Adaptive Online Scheduling in Storm Paper by Leonardo Aniello, Roberto Baldoni, and Leonardo Querzoni Presentation by Keshav Santhanam.

1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.

Memory Management.

Introduction | Model | Solution | Evaluation

A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.

CPU SCHEDULING.

Networked Real-Time Systems: Routing and Scheduling

Replica Placement Heuristics of Application-level Multicast

Resource Allocation for Distributed Streaming Applications

Presentation transcript:

1 On-line Parallel Tomography Shava Smallen UCSD

2 I) Introduction to On-line Parallel Tomography II) Tunable On-line Parallel Tomography III) User-directed application-level scheduler IV) Experiments V) Conclusion Talk Outline

3 What is tomography? A method for reconstructing the interior of an object from its projections At the National Center for Microscopy and Imaging Research (NCMIR), tomography is applied to electron microscopy to study specimens at the cellular and subcellular level

4 Tomogram of spiny dendrite (Images courtesy of Steve Lamont) Example

5 Parallel Tomography at NCMIR Embarrassingly parallel X Y slice specimen Z scanline projection scanline

6 NCMIR Usage Scenarios Off-line parallel tomography (off-line PT) –Data resides somewhere on secondary storage –Single, high quality tomogram –Reduce turnaround time –Previous work (HCW’ 00) On-line parallel tomography (on-line PT) –Data streamed from the electron microscope long makespan, configuration errors, etc. –Iteratively computed tomogram –Soft real-time execution

7 On-line PT Real-time feedback on quality of data acquisition 1) First projection acquired from microscope 2) Generate coarse tomogram 3) Iteratively refine tomogram using subsequent projections (refresh) Update each voxel value Size of tomogram is constant

8 NCMIR Target Platform Multi-user, heterogenous resources –NCMIR cluster SGI Indigo2, SGI Octane, SUN ULTRA, SUN Enterprise IRIX, Solaris –Meteor cluster Pentium III dual proc Linux, PBS –Blue Horizon AIX, Loadleveler, Maui Scheduler network

slices preprocessor ptomo writer On-line PT Architecture projection scanlines tomogram

10 On-line PT Design 1) Frame on-line parallel tomography as a tunable application –Resource limitations / dynamic –Availability of alternate configurations [Chang,et al] each configuration corresponds to different output quality and resource usage 2) Coupled with user-directed application- level scheduler (AppLeS) –adaptive scheduler –promote application performance

11 On-line PT Configuration Triple: (f, r, su) Reduction factor (f) –Reduce resolution of data  reduce both computation and communication Projections per refresh (r) –Reduce refinement frequency  reduce communication Service Units - (su) –Increase cost of execution  increase computational power

12 User Preferences Best configuration (f, r, su) = (1, 1, 0 ) Several possible configurations  user specifies bounds –projections should be at least size 256x256 1  f  4 or 1  f  8 –user could tolerate up to a 10 minute time wait 1  r  13 –reasonable upper bound 0  su  (50 x acquisition period x c)

13 User-directed Feasible? –Use dynamic load information –if work allocation found Better? –e.g. 1. (1, 6, 4) - best f 2. (2, 2, 8) - good su/r 3. (2, 1, 20) - best r reduction factor projections per refresh service units

generate request display triples adjust request review triples process request find work allocation execute on-line PT accepts one rejects all infeasible feasible User-directed AppLeS User User-directed AppLeS

15 Triple Search Search parameter space –If triple satisfies constraints  feasible Constrained optimization problem based on soft real-time execution –compute constraint –transfer constraint Heuristics to reduce search space – e.g. assume user will always choose (1,2,1) over (1,2,4)

16 Work Allocation work allocation transfer constraints cost user constraints compute constraints cpu availability processor availability ptomo-to-writer bandwidth subnet-to-writer bandwidth Multiple mixed-integer programs  approx soln

17 Experiments Impact of dynamic information on scheduler performance Usefulness of tunability Grid environments Scheduling latency

18 Dynamic Information We fix the triple and let schedulers determine work allocation

19 Evaluate schedulers –Repeatibility –Long makespan –several resource environments Simgrid (Casanova [CCGrid’2001]) –API for evaluating scheduling algorithms tasks resources modeled using traces –E.g. Parameter sweep applications [HCW’00] Simtomo Simulation

20 relative refresh lateness expected refresh period actual refresh period Relative refresh lateness Performance Metric

21 NCMIR experiments Traces (8 machines) –8 hour work day on March 8th, 2001 Ran simulations throughout day at 10 minute intervals 8:00 am 4:00 pm

22 Perfect Load Predictions hours since 3/8/ :00 PST mean relative refresh lateness wwa wwa+cpu wwa+bw AppLeS

23 Imperfect Load Predictions Student Version of MATLAB

24 Synthetic Grids Bandwidth predictibility –Average prediction error –p i  {L, M, H} –p 1 p 2 p 3 e.g. LMH –27 types –2510 Grids x 4 schedulers –10,040 simulations p1p1 p2p2 p3p3

25 Relative Scheduler Performance Student Version of MATLAB

26 Partial Ordering Performance vs. bandwidth predictability Grid predictibility –Partial orders using p 1 p 2 p 3 –Comparable/Not Comparable e.g. HML is comparable to HLL e.g. HLM is not comparable to LHM HHH, HHM, HMM, HLM, MLM, LLM, LLL

27 Example Partial Order HHHHHMHMMHLMMLMLLMLLL relative refresh lateness (seconds) wwa wwa+cpu wwa+bw AppLeS

28 Tunability Experiments How useful is tunability? –variability Fixed topology –categorized traces L, M, H –v 1 v 2 v 3 v 4 v 5 –243 Grid types v2v2 v1v1 v3v3 v4v4 v5v5

29 Tunability Experiments Run over a 2 day period –back-to-back –assume single user model f, r, su Set of triples chosen –T = {1,…,61}

30 Tunability Results fraction of changes parameters f r su Count how many times a triple changed per 2-day simulation e.g. –12.9% –25.7%

31 Scheduling Latency Time to search for feasible triples e.g. –88% under 1 sec –63% under 1 sec

32 Conclusions and Future Work Grid-enabled version of on-line parallel tomography –Tunable application Tunability is useful in Grid environments –User-directed AppLeS Importance of bandwidth predictability –e.g. rescheduling Scheduling latency is nominal Production use

33 Search optimization (f min,r min,su min ) (f max,r max,su max ) (f min,r min ) (f max,r max ) (f min,su min ) (f max,su max ) (r min,su min )