Towards Intelligent Workflow Planning for Neuroimaging Analyses Irfan Habib, Ashiq Anjum, Peter Bloodsworth, Richard McClatchey Centre for Complex Cooperative.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Rolls-Royce supported University Technology Centre in Control and Systems Engineering UK e-Science DAME Project Alex Shenfield
Interactive Evolutionary Computation Review of Applications Praminda Caleb-Solly Intelligent Computer Systems Centre University of the West of England.
Dynamic Task Assignment Load Index for Geographically Distributed Web Services PhD Research Proposal By: Dhiah Al-Shammary Supervised.
2009 – E. Félix Security DSL Toward model-based security engineering: developing a security analysis DSML Véronique Normand, Edith Félix, Thales Research.
A Local-Optimization based Strategy for Cost-Effective Datasets Storage of Scientific Applications in the Cloud Many slides from authors’ presentation.
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
Feature Grouping-Based Fuzzy-Rough Feature Selection Richard Jensen Neil Mac Parthaláin Chris Cornelis.
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
IoP HEPP 2004 Birmingham, 7/4/04 David Cameron, University of Glasgow 1 Simulation of Replica Optimisation Strategies for Data.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
University of Minho School of Engineering Centre Algoritmi Uma Escola a Reinventar o Futuro – Semana da Escola de Engenharia - 24 a 27 de Outubro de 2011.
Improving Robustness in Distributed Systems Jeremy Russell Software Engineering Honours Project.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Crossover Operation with Different Parents Crossover Operation with Identical Parents.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
Optimal Arrangement of Ceiling Cameras for Home Service Robots Using Genetic Algorithms Stefanos Nikolaidis*, ** and Tamio Arai** *R&D Division, Square.
1 PSO-based Motion Fuzzy Controller Design for Mobile Robots Master : Juing-Shian Chiou Student : Yu-Chia Hu( 胡育嘉 ) PPT : 100% 製作 International Journal.
Introduction Due to the recent advances in smart grid as well as the increasing dissemination of smart meters, the electricity usage of every moment in.
By Paul Cottrell, BSc, MBA, ABD. Author Complexity Science, Behavioral Finance, Dynamic Hedging, Financial Statistics, Chaos Theory Proprietary Trader.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
MapReduce VS Parallel DBMSs
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
©2009 Excel Experts. All rights reservedJune Johannesburg, South Africa Introduction An.
Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
D1.HGE.CL7.01 D1.HGA.CL6.08 Slide 1. Introduction Design, prepare and present reports  Classroom schedule  Trainer contact details  Assessments  Resources:
A Model of the Environment for Flexible Access to Complex Distributed Applications Michal Kosiedowski
Embedding Constraint Satisfaction using Parallel Soft-Core Processors on FPGAs Prasad Subramanian, Brandon Eames, Department of Electrical Engineering,
Frankfurt (Germany), 6-9 June 2011 Steven Inglis – United Kingdom – RIF Session 5 – Paper 0434 Multi-Objective Network Planning tool for the optimal integration.
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Bi-directional incremental evolution Dr Tatiana Kalganova Electronic and Computer Engineering Dept. Bio-Inspired Intelligent Systems Group Brunel University.
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
IIIT Hyderabad Scalable Clustering using Multiple GPUs K Wasif Mohiuddin P J Narayanan Center for Visual Information Technology International Institute.
Taguchi. Abstraction Optimisation of manufacturing processes is typically performed utilising mathematical process models or designed experiments. However,
2013/12/09 Yun-Chung Yang Partitioning and Allocation of Scratch-Pad Memory for Priority-Based Preemptive Multi-Task Systems Takase, H. ; Tomiyama, H.
Copyright © 2006, UCD Dublin Systems Research Group School of Computer Science and Informatics UCD Dublin, Belfield, Dublin 4, Ireland
Task Graph Scheduling for RTR Paper Review By Gregor Scott.
George Goulas, Christos Gogos, Panayiotis Alefragis, Efthymios Housos Computer Systems Laboratory, Electrical & Computer Engineering Dept., University.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Efficient Load Balancing Algorithm for Cloud Computing Network Che-Lun Hung 1, Hsiao-hsi Wang 2 and Yu-Chen Hu 2 1 Dept. of Computer Science & Communication.
CRISTAL Andrew Branson University of the West of England.
Workflows Description, Enactment and Monitoring in SAGA Ashiq Anjum, UWE Bristol Shantenu Jha, LSU 1.
Parallel Applications And Tools For Cloud Computing Environments CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Generic competencesDescription of the Competence Learning Competence The student  possesses the capability to evaluate and develop one’s own competences.
MSE Presentation 3 By Lakshmikanth Ganti Under the Guidance of Dr. Virgil Wallentine – Major Professor Dr. Paul Smith – Committee Member Dr. Mitch Neilsen.
Improving System Availability in Distributed Environments Sam Malek with Marija Mikic-Rakic Nels.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
Research Traceability using Provenance Services for Biomedical Analysis Dr Peter Bloodsworth CCCS Research Centre UWE, Bristol, UK
EVOLUTIONARY SYSTEMS AND GENETIC ALGORITHMS NAME: AKSHITKUMAR PATEL STUDENT ID: GRAD POSITION PAPER.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
Alan P. Reynolds*, David W. Corne and Michael J. Chantler
Cluster formation based comparison of Genetic algorithm and Particle Swarm Optimization in Wireless Sensor Network Ms.Amita Yadav.
Model Execution Environment for Investigation of Heart Valve Diseases
Meteorological Big Data-as-a-Service: SOA based Environment and Methods for Meteorological Big Data Exploration Yaqiang Wang Chengdu University of Information.
Efficient Load Balancing Algorithm for Cloud
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Class project by Piyush Ranjan Satapathy & Van Lepham
Smita Vijayakumar Qian Zhu Gagan Agrawal
EPQ Learner Outcomes identify, design, plan and complete an individual project, applying a range of organisational skills and strategies to meet.
An Experimental Study of the Potential of Using Small
Lecture 4. Niching and Speciation (1)
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Presentation transcript:

Towards Intelligent Workflow Planning for Neuroimaging Analyses Irfan Habib, Ashiq Anjum, Peter Bloodsworth, Richard McClatchey Centre for Complex Cooperative Systems, BIT, University of the West of England, Bristol

Introduction Recent progress in neuroimaging techniques and data formats has led to an explosive growth in neuroimaging data Analysis of this data can facilitate research in neuro- degenerative diseases.

Clinical Users Commercial Partners Academic Partners

Neuroimaging datasets are generally processed through Neuroimaging pipelines

CIVET produces 1100% more data than it consumes, and intermediate data usage is more than 4000%. Without optimisation runtime of a single workflow is 8 hrs

CIVET Pipeline 85% of All Tasks in CIVET execute in less than 512 secs

CIVET Pipeline These 85% of tasks in CIVET perform just 8% of the computation

Existing Approaches State-of-the-art approaches for workflow planning include: Data-based Methods: Data elimination, data diffusion Task-based Approaches: Task Clustering Scheduling-based Approaches

Task Clustering Normalised Workflow turnaround time (with respect to standard CIVET on SGE Cluster) CIVET

Task Clustering Normalised Cumulative Data Retrieval (with respect to standard CIVET on SGE Cluster) CIVET

What are the issues? Different clustering strategies work for different types of workflows. A specific automated horizontal task clustering strategy created a computationally efficient workflow in this case.

Higher Data Affinity More Coarse Grained Tasks Fine-grained Tasks with Low-level of data-interdependencies Coarse-grained Tasks with High-level of data-interdependencies What are the issues?

Creating an efficient workflow plan involves consideration of several trade- offs! Various parameters need to be optimised: Data efficiency, scheduling latency, workflow turn-around time, network latencies. Hence workflow planning is a multi- dimensional optimisation problem. What are the issues?

This paper proposes an initial single- objective genetic algorithm based workflow planning approach.

B1B1 C2C2 C4C4 C3C3 B2B2 C3C3

B1B1 C4C4 C3C3 B1B1 C4C4 C3C3 B1B1 C4C4 C3C3 B1B1 C4C4 C3C3 B1B1 C4C4 C3C3 Enact Workflow GridGrid Provenance Storage Store Provenance Data Randomly Planned User Submitted Workflows B1B1 C2C2 C4C4 C3C3 B2B2 C3C3

Provenance Data Fitness Calculation SelectionSelection Genetic operators Pipeline Service Planner

The workflow planning approach will first be simulated in SimGRID. Various parameters for the planning approach will be tweaked and evaluated Type of selection producing the quickest convergence towards efficiency Extending fitness functions for multi- objectives Implementation of the Approach

Conclusion Several workflow planning techniques exist, however prior knowledge about the nature of the workflow is required to select an appropriate technique. This paper proposes a single-objective evolutionary workflow planning approach to optimise workflow turn- around times. The approach will be first implemented in a SimGrid environment and results will be shared in future publications.