HPSA18: Logistics 7:00 am – 8:00 am Breakfast

Slides:



Advertisements
Similar presentations
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Advertisements

Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
OVERVIEW OF NETWORKING RESEARCH IN NETLAB 1 Dr. Jim Martin Associate Professor School of Computing Clemson University
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
Iterative computation is a kernel function to many data mining and data analysis algorithms. Missing in current MapReduce frameworks is collective communication,
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
4.x Performance Technology drivers – Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components.
Slide 1 Auburn University Computer Science and Software Engineering Scientific Computing in Computer Science and Software Engineering Kai H. Chang Professor.
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
Extreme scale parallel and distributed systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward.
Data Science at Digital Science October Geoffrey Fox Judy Qiu
Scientific Computing Environments ( Distributed Computing in an Exascale era) August Geoffrey Fox
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
51 Detailed Use Cases: Contributed July-September 2013 Covers goals, data features such as 3 V’s, software, hardware
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
SALSASALSASALSASALSA Digital Science Center February 12, 2010, Bloomington Geoffrey Fox Judy Qiu
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary II: Layer 5 I590 Data Science Curriculum August Geoffrey Fox
SALSASALSA Large-Scale Data Analysis Applications Computer Vision Complex Networks Bioinformatics Deep Learning Data analysis plays an important role in.
Indiana University Faculty Geoffrey Fox, David Crandall, Judy Qiu, Gregor von Laszewski Data Science at Digital Science Center.
1 Panel on Merge or Split: Mutual Influence between Big Data and HPC Techniques IEEE International Workshop on High-Performance Big Data Computing In conjunction.
Geoffrey Fox Panel Talk: February
Panel: Beyond Exascale Computing
Penn State Center for e-Design Site Vision and Capabilities
Organizations Are Embracing New Opportunities
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Digital Science Center II
Geoffrey Fox, Shantenu Jha, Dan Katz, Judy Qiu, Jon Weissman
Status and Challenges: January 2017
HPC Cloud Convergence February 2017 Software: MIDAS HPC-ABDS
NSF start October 1, 2014 Datanet: CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science Indiana University.
Data Management & Analysis in MATTER
Department of Intelligent Systems Engineering
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
Department of Intelligent Systems Engineering
Digital Science Center I
I590 Data Science Curriculum August
High Performance Big Data Computing in the Digital Science Center
Data Science Curriculum March
Biology MDS and Clustering Results
Tutorial Overview February 2017
Data Science for Life Sciences Research & the Public Good
Hilton Hotel Honolulu Tapa Ballroom 2 June 26, 2017 Geoffrey Fox
Martin Swany Gregor von Laszewski Thomas Sterling Clint Whaley
Research in Digital Science Center
Cloud Evolution Dennis Gannon
Scalable Parallel Interoperable Data Analytics Library
Clouds from FutureGrid’s Perspective
Tor Skeie Feroz Zahid Simula Research Laboratory 27th June 2018
Discussion: Cloud Computing for an AI First Future
Digital Science Center III
Department of Intelligent Systems Engineering
Digital Science Center
$1M a year for 5 years; 7 institutions Active:
Brian Matthews STFC EOSCpilot Brian Matthews STFC
FY19 LEARNING PRIORITIES
3 Questions for Cluster and Grid Use
PHI Research in Digital Science Center
Defining the Grid Fabrizio Gagliardi EMEA Director Technical Computing
Summary of Grid Portal Architecture Workshop March Tokyo GGF7
PolarGrid and FutureGrid
Panel on Research Challenges in Big Data
Big Data, Simulations and HPC Convergence
Geoffrey Fox High-Performance Big Data Computing: International, National, and Local initiatives COLLABORATORS China and IU: Fudan University, SICE, OVPR.
Research in Digital Science Center
Convergence of Big Data and Extreme Computing
I590 Data Science Curriculum August
Presentation transcript:

HPSA18: Logistics 7:00 am – 8:00 am Breakfast Luddy Room 3166 (3rd floor) 10:00 am – 10:30 am Break 12:00 pm – 1:00 pm Lunch Community Center (1st floor) 2:40 pm – 3:00 pm 4:30 pm – 6:00 pm Panel Session 6:30 pm – 8:30 pm Dinner Finch’s Brasserie Restaurant 

Speakers Rick Van Kooten OVPR, Judy Qiu IU, and X. Sean Wang Fudan: Introduction Linton Ward, IBM Building the cognitive platform to accelerate innovation (Keynote 1) Nathan Greeneltch Intel, Learn Faster with Intel Data Analytics Acceleration Library (DAAL) Takuya Araki NEC, SX-Aurora TSUBASA and its application to machine learning Anthony Skjellum UTC MPI and MPI-like Middleware in the Age of HPC Analytics Andrew Younge Sandia, Supporting High Performance Analysts with System Software for Virtualized Supercomputing Anil Vullikanti Virginia Tech, Finding Trees and Anomalous Subgraphs in Parallel Albert Jonathan Univ. of Minnesota, Geo-Distributed Clouds For Data Analytics Tony Hey UK STFC Big Scientific Data and Data Science (Keynote2) Piotr Luszczek Univ. of Tenessee, HPC Autotuning Techniques for Computational Kernels in Data Analytics Scott Michael UITS, IU Big Data Infrastructure Wo Chang NIST, NIST PWG Big Data Reference Architecture for HPC and Analytics X. Sean Wang Fudan University, Overview including Astronomy (SKA) Data Analysis Weihua Zhang Fudan, Eunomia: Scaling Concurrent Search Trees under Contention Using HTM Martin Swany IU, Hardware-Accelerated Network Microservices for Big Data and Extreme Scale Computing Geoffrey Fox IU, High Performance Big Data Computing in the Digital Science Center Panel Session (Chair: Dennis Gannon) on 5 year challenges in HPSA

HPSA18: Timer 20 Minutes presentation including Q/A 5 minutes reminder

A path to future Artificial General Intelligence HPSA18: A path to future Artificial General Intelligence Theory Experiments or Observation Simulation of Theory or model Supercomputers Data-Driven or The Fourth Paradigm: Data-Intensive Scientific Discovery (aka Data Science)

HPSA Vision HPSA involves Large-Scale Data Analytics on High-Performance Computing (HPC) clusters optimized for data analysis. HPSA has been identified by Gartner in their infrastructure strategies priority matrix under the rubric of Hyperscale computing, as having transformational importance with their top rating in the 5-10 year timeframe. The potential impact on scientific discovery and economic development from Data Analytics is tremendous. The workshop features innovative research and development in hardware, algorithms and software for big data systems of transformational capability on computer architectures ranging from commodity clouds, hybrid HPC-clouds, and supercomputers. It aims at performance and security that scales and fully exploit the specialized features (communication, memory, energy, I/O, accelerator) of each different architecture. Studies of new architectures and benchmarking of existing systems. Applications will range from pleasingly parallel, MapReduce, to Machine Learning (e.g., Random Forest, SVM, Latent Dirichlet Allocation, Clustering and Dimension Reduction), Deep Learning, and Large Graph Analytics.

Cloud Computing Spending $

Gartner: Hype Cycle for Emerging Technologies, 2017 Published: 21 July 2017 ID: G00314560 Analyst(s): Mike J. Walker

Gartner: Priority Matrix for Emerging Technologies2017 Hype Cycle for Emerging Technologies, 2017 Published: 21 July 2017 ID: G00314560 Analyst(s): Mike J. Walker

Gartner: Hype Cycle for Cloud Computing, 2017 Published: 01 August 2017 ID: G00315206 Analyst(s): David Mitchell Smith | Ed Anderson

Different choices in software systems in Clouds and HPC HPC-ABDS takes cloud software augmented by HPC when needed to improve performance 16 of 21 layers plus languages

Figure. Cloud-HPC interoperable software for High Performance Big Data Analytics at Scale

Difficulty in Parallelism

Building a Community of Communities First BDECng meeting: Indiana University Bloomington October 3-5 2018 The BDECng community is engaged in a shaping strategy and process that builds on the collective expertise of an expanded community: New application domains, Infrastructure providers, Technology community.