Extreme scale parallel and distributed systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at 33.86 petaflops Pushing toward.

Slides:



Advertisements
Similar presentations
Large Scale Computing Systems
Advertisements

SALSA HPC Group School of Informatics and Computing Indiana University.
HadoopDB Inneke Ponet.  Introduction  Technologies for data analysis  HadoopDB  Desired properties  Layers of HadoopDB  HadoopDB Components.
Towards an Exa-scale Operating System* Ely Levy, The Hebrew University *Work supported in part by a grant from the DFG program SPPEXA, project FFMK.
2. Computer Clusters for Scalable Parallel Computing
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
1 Grid Modernization – A Strategic Imperative for 2050 Advanced Energy Conference May 1, 2013 By Carl Imhoff Electric Infrastructure Sector Manager Pacific.
OCIN Workshop Wrapup Bill Dally. Thanks To Funding –NSF - Timothy Pinkston, Federica Darema, Mike Foster –UC Discovery Program Organization –Jane Klickman,
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
High Performance Computing with cloud Xu Tong. About the topic Why HPC(high performance computing) used on cloud What’s the difference between cloud and.
Chapter 2 Computer Clusters Lecture 2.1 Overview.
Parallelization: Conway’s Game of Life. Cellular automata: Important for science Biology – Mapping brain tumor growth Ecology – Interactions of species.
Iterative computation is a kernel function to many data mining and data analysis algorithms. Missing in current MapReduce frameworks is collective communication,
Jawwad A Shamsi Nouman Durrani Nadeem Kafi Systems Research Laboratories, FAST National University of Computer and Emerging Sciences, Karachi Novelties.
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
Slide 1 Auburn University Computer Science and Software Engineering Scientific Computing in Computer Science and Software Engineering Kai H. Chang Professor.
K E Y : SW Service Use Big Data Information Flow SW Tools and Algorithms Transfer Application Provider Visualization Access Analytics Curation Collection.
COLUMBIA UNIVERSITY Department of Electrical Engineering The Fu Foundation School of Engineering and Applied Science IN THE CITY OF NEW YORK Networking.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Workload Optimized Processor
Power Distribution and Redistribution of Workloads in Cloud Computing Facilities Cornell Wilson.
Face Detection And Recognition For Distributed Systems Meng Lin and Ermin Hodžić 1.
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Extreme-scale computing systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward exa-scale computing.
Gilad Shainer, VP of Marketing Dec 2013 Interconnect Your Future.
Cluster Workstations. Recently the distinction between parallel and distributed computers has become blurred with the advent of the network of workstations.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
Directed Reading 2 Key issues for the future of Software and Hardware for large scale Parallel Computing and the approaches to address these. Submitted.
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
SALSA HPC Group School of Informatics and Computing Indiana University.
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
K E Y : SW Service Use Big Data Information Flow SW Tools and Algorithms Transfer Transformation Provider Visualization Access Analytics Curation Collection.
Requirements of Programming Models ● Requirements never change:  Programmability  Performance ● Satisfying these requirements, of course is a function.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Partitioned Multistack Evironments for Exascale Systems Jack Lange Assistant Professor University of Pittsburgh.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Big Data Directions Greg.
Interconnection network network interface and a case study.
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using MapReduce on Cloud.
1 WRB 09/02 HPEC Lincoln Lab Sept 2002 Poster B: Software Technologies andSystems W. Robert Bernecky Naval Undersea Warfare Center Ph: (401) Fax:
Big Data Open Source Software and Projects ABDS in Summary II: Layer 5 I590 Data Science Curriculum August Geoffrey Fox
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Distributed Process Discovery From Large Event Logs Sergio Hernández de Mesa {
A Classification for Access Control List To Speed Up Packet-Filtering Firewall CHEN FAN, LONG TAN, RAWAD FELIMBAN and ABDELSHAKOUR ABUZNEID Department.
Usage Of Cloud Computing Simulators And Future Systems In Computational Research Dr. Ramkumar Lakshminarayanan Mr. Rajasekar Ramalingam.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Percipient StorAGe for Exascale Data Centric Computing Exascale Storage Architecture based on “Mero” Object Store Giuseppe Congiu Seagate Systems UK.
F1-17: Architecture Studies for New-Gen HPC Systems
Panel: Beyond Exascale Computing
Organizations Are Embracing New Opportunities
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Networking & Communications Prof. Javad Ghaderi
ICT NCP Infoday Brussels, 23 June 2010
Algorithms for Big Data Delivery over the Internet of Things
Low Latency Analytics HPC Clusters
Big Data - in Performance Engineering
Scalable Parallel Interoperable Data Analytics Library
Tor Skeie Feroz Zahid Simula Research Laboratory 27th June 2018
Software Acceleration in Hybrid Systems Xiaoqiao (XQ) Meng IBM T. J
Cloud Computing Large-scale Resource Management
CIS5930: Advanced Topics in Parallel and Distributed Systems
Panel on Research Challenges in Big Data
Distributed Systems (15-440)
Motivation Contemporary big data tools such as MapReduce and graph processing tools have fixed data abstraction and support a limited set of communication.
Convergence of Big Data and Extreme Computing
Presentation transcript:

Extreme scale parallel and distributed systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward exa-scale computing by 2020, 32 times bigger than Tianhe-2 (almost need to double the speed every year). Many issues ranging from applications to systems such power, resilience, networking, applications.

Extreme scale parallel and distributed systems – Cloud computing data centers: Amazon EC2 Hugh push to move computing/storage to the cloud computing infrastructure Extreme scale to achieve the scale of economics Applications are more diverse – Networking infrastructure needs significant improvement – Security

Extreme scale parallel and distributed systems – Big data platforms: hadoop cluster? Huge hype Not clear what is beyond the traditional HPC and cloud computing platforms.

Issues related to extreme scale systems How to use the systems – Programming paradigms What changes when the scale becomes big? How to build the systems – Hardware and systems issues What changes when the scale becomes big?

Programming for extreme scale PDS Ease of use.vs. performance Distributed memory programming – Message Passing Interface (MPI) – Mapreduce (Hadoop) Hybrid shared memory and distributed memory programming – Matching the architecture -- CMP+SMP clusters – Hybrid OpenMP+MPI GPU/MIC programming and hybrid programming – More potential to achieve exa-scale within power limit – GPU, MIC – Hybrid GPU/MIC + MPI

Architecture/interconnects Extreme-scale PDSs are Internet-in-a-building – Traditional networking issues: topology, routing, flow control, congestion control

Architecture/interconnects Current and Emerging network architectures – InfiniBand and 10/100-G E (technology) – Openflow and software defined networks (network architecture) – Recent topology/routing proposals for extreme scale systems Achieving performance requirement with the budget constraints.

System software and communication sub-systems – Parallel IO systems – Topology aware job allocation and node mapping – Communication protocols – One-sided.vs. two-sided communications – Collective communication algorithms

Performance models and evaluation methods Performance modeling techniques for networks/systems/applications. Workload characterization. Application tracing Challenges in simulation and modeling of large scale systems using realistic workloads

Resilience and power-awareness System and application resilience techniques and analysis Fault tolerance techniques in hardware and software Resource management for system resilience and availability. Energy efficient HPC Energy efficient data centers

This course Targets students who are interested in research and development in large scale sytems. – Go through the recent advances in these subjects, and bring you up-to-date in research in this area in general. – Introduce software, algorithmic, and analytical tools and techniques that are necessary to perform research in this area.