Numerical Libraries Project Microsoft Incubation Group Mary Beth Hribar Microsoft Corporation CSCAPES Workshop June 10, 2008 Copyright Microsoft Corporation,

Slides:



Advertisements
Similar presentations
GPE4UNICORE Grid Programming Environment for UNICORE
Advertisements

Refining High Performance FORTRAN Code from Programming Model Dependencies Ferosh Jacob University of Alabama Department of Computer Science
Statistical Modeling of Feedback Data in an Automatic Tuning System Richard Vuduc, James Demmel (U.C. Berkeley, EECS) Jeff.
Technology Drivers Traditional HPC application drivers – OS noise, resource monitoring and management, memory footprint – Complexity of resources to be.
Software & Services Group, Developer Products Division Copyright© 2010, Intel Corporation. All rights reserved. *Other brands and names are the property.
Priority Research Direction Key challenges General Evaluation of current algorithms Evaluation of use of algorithms in Applications Application of “standard”
Software Engineering 1. Software development – the grand view 2. Requirements engineering.
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
Seminar On “ OMNET++ Network Simulator” Presented By: Saurav K Bengani Guided By: Guided By: Dr. Andrew yang Dr. Andrew yang.
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
Sixth Edition 1 M a n a g e m e n t I n f o r m a t i o n S y s t e m s M a n a g I n g I n f o r m a t i o n T e c h n o l o g y i n t h e E – B u s i.
Copyright HiPERiSM Consulting, LLC, George Delic, Ph.D. HiPERiSM Consulting, LLC (919) P.O. Box 569, Chapel Hill,
Parallelized variational EM for Latent Dirichlet Allocation: An experimental evaluation of speed and scalability Ramesh Nallapati, William Cohen and John.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
6/14/2015 How to measure Multi- Instruction, Multi-Core Processor Performance using Simulation Deepak Shankar Darryl Koivisto Mirabilis Design Inc.
Software Group © 2006 IBM Corporation Compiler Technology Task, thread and processor — OpenMP 3.0 and beyond Guansong Zhang, IBM Toronto Lab.
May 25, 2010 Mary Hall May 25, 2010 Advancing the Compiler Community’s Research Agenda with Archiving and Repeatability * This work has been partially.
Presenter: Joshan V John Robert Dyer, Hoan Anh Nguyen, Hridesh Rajan & Tien N. Nguyen Iowa State University, USA Instructor: Christoph Csallner 1 Joshan.
Trigger and online software Simon George & Reiner Hauser T/DAQ Phase 1 IDR.
Leveling the Field for Multicore Open Systems Architectures Markus Levy President, EEMBC President, Multicore Association.
Efficient Hardware dependant Software (HdS) Generation using SW Development Platforms Frédéric ROUSSEAU CASTNESS‘07 Computer Architectures and Software.
Windows Server 2008 R2 and IT Challenges Windows Server Solution Accelerators Microsoft Assessment and Planning Toolkit 4.0 Next.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Introduction to Information Retrieval CS 5604: Information Storage and Retrieval ProjCINETViz by Maksudul Alam, S M Arifuzzaman, and Md Hasanuzzaman Bhuiyan.
Priority Research Direction Key challenges Fault oblivious, Error tolerant software Hybrid and hierarchical based algorithms (eg linear algebra split across.
OpenMP in a Heterogeneous World Ayodunni Aribuki Advisor: Dr. Barbara Chapman HPCTools Group University of Houston.
03/12/20101 Analysis of FPGA based Kalman Filter Architectures Arvind Sudarsanam Dissertation Defense 12 March 2010.
PROGRAMMING IN VISUAL BASIC.NET INTRODUCTION TO VISUAL BASIC.NET Bilal Munir Mughal 1 Chapter-1.
1 Discussions on the next PAAP workshop, RIKEN. 2 Collaborations toward PAAP Several potential topics : 1.Applications (Wave Propagation, Climate, Reactor.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
ICOM 5995: Performance Instrumentation and Visualization for High Performance Computer Systems Lecture 7 October 16, 2002 Nayda G. Santiago.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
Tessellation: Space-Time Partitioning in a Manycore Client OS Rose Liu 1,2, Kevin Klues 1, Sarah Bird 1, Steven Hofmeyr 3, Krste Asanovic 1, John Kubiatowicz.
Offline Coordinators  CMSSW_7_1_0 release: 17 June 2014  Usage:  Generation and Simulation samples for run 2 startup  Limited digitization and reconstruction.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
HPC User Forum Back End Compiler Panel SiCortex Perspective Kevin Harris Compiler Manager April 2009.
Lixia Liu, Zhiyuan Li Purdue University, USA PPOPP 2010, January 2009.
I n t e g r i t y - S e r v i c e - E x c e l l e n c e UML to OPNET SW Performance Modeling November 6, 2008 John James (E547) Greg Quinn (E547) Ed Walters.
Building a real-world, Internet- scale stock trading application Naveen Prabhu Quadwave Consulting Pvt. Ltd.
Software Engineering 1 Object-oriented Analysis and Design Applying UML and Patterns An Introduction to Object-oriented Analysis and Design and Iterative.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 1 Automate your way to.
Linear Algebra Libraries: BLAS, LAPACK, ScaLAPACK, PLASMA, MAGMA
 Programming - the process of creating computer programs.
Platform Abstraction Group 3. Question How to deal with different types hardware and software platforms? What detail to expose to the programmer? What.
MPEG-4 Structured Audio Mihir Anandpara EE 382C – Embedded Software Systems.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
Computing Systems: Next Call for Proposals Dr. Panagiotis Tsarchopoulos Computing Systems ICT Programme European Commission.
Professor Arthur Trew Director, EPCC EPCC: KT in novel computing.
C OMPUTATIONAL R ESEARCH D IVISION 1 Defining Software Requirements for Scientific Computing Phillip Colella Applied Numerical Algorithms Group Lawrence.
Building Composable Parallel Software with Liquid Threads Heidi Pan*, Benjamin Hindman +, Krste Asanovic + *MIT, + UC Berkeley Microsoft Numerical Library.
EU-Russia Call Dr. Panagiotis Tsarchopoulos Computing Systems ICT Programme European Commission.
Linear Algebra Libraries: BLAS, LAPACK, ScaLAPACK, PLASMA, MAGMA Shirley Moore CPS5401 Fall 2013 svmoore.pbworks.com November 12, 2012.
ISCUG Keynote May 2008 Acknowledgements to the TI-Nokia ESL forum (held Jan 2007) and to James Aldis, TI and OSCI TLM WG Chair 1 SystemC: Untapped Value.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
B ERKELEY P AR L AB Lithe Composing Parallel Software Efficiently PLDI  June 09, 2010 Heidi Pan, Benjamin Hindman, Krste Asanovic  {benh,
VisIt Project Overview
Benchmarking Deep Learning Inference
Enabling Effective Utilization of GPUs for Data Management Systems
Welcome: Intel Multicore Research Conference
Many-core Software Development Platforms
Chapter 3: Windows7 Part 1.
F# for Parallel and Asynchronous Programming
Analysis models and design models
Bin Ren, Gagan Agrawal, Brad Chamberlain, Steve Deitz
Introduction to Virtual Machines
Introduction to Virtual Machines
Executable Specifications
Rohan Yadav and Charles Yuan (rohany) (chenhuiy)
Presentation transcript:

Numerical Libraries Project Microsoft Incubation Group Mary Beth Hribar Microsoft Corporation CSCAPES Workshop June 10, 2008 Copyright Microsoft Corporation, All Rights Reserved

Microsoft Incubation Group John Manferdelli OS, Resource Management Security, Partitioning, Virtualization Application & Libraries Incubation Group Craig Mundie Steve BallmerBill Gates Microsoft Research Research Organization Copyright Microsoft Corporation, All Rights Reserved

Many-Core Collaboration Microsoft effort to realize the performance of many-core architectures in their products Widespread internal and external projects to produce complete parallel programming software solution to customers Copyright Microsoft Corporation, All Rights Reserved

Many-Core Software Solution Programming Models and Languages Domain Libraries Applications Runtime, Platform and Operating System Copyright Microsoft Corporation, All Rights Reserved

Numerical Libraries Project Project in Incubation Group starting Jan, 2008 Investigate numerical libraries for single node many-core architectures – Develop an initial parallel numerical library specifically for Microsoft platforms – Demonstrate use of library in parallel applications – Contribute to adoption of Microsoft platforms on future many-core and HPC architectures Copyright Microsoft Corporation, All Rights Reserved

Tasks Investigate small set of functionality Implement routines for Microsoft platform Consider design of interfaces Analyze performance Copyright Microsoft Corporation, All Rights Reserved

Functional Areas Investigate “core” math functions Initially target dense and sparse linear algebra Will be looking at other areas – Graph algorithms – Random number generators – FFTs Applications drive priority of routines to implement Copyright Microsoft Corporation, All Rights Reserved

Implementation Implement libraries for Microsoft platforms Write versions for native and managed code – Native: C++ – Managed: C# Evaluate parallel programming models Copyright Microsoft Corporation, All Rights Reserved

Interfaces Investigate the design of managed code interfaces – “Expert” interfaces allow users to access all algorithmic and performance parameters – “Productivity” interfaces allow users to ignore low level details Copyright Microsoft Corporation, All Rights Reserved

Performance Analyze performance of library routines – Scaling performance of algorithms – Performance of native vs. managed code – Performance of programming models Demonstrate performance of libraries in applications Copyright Microsoft Corporation, All Rights Reserved

Challenges Scalable performance on range of many-core architectures – Varying number of cores, memory bandwidth, layout, etc. – Successive generations of hardware – Small and large input sizes Target Microsoft software environments – Existing libraries and parallel applications written in native code – Libraries for Microsoft’s managed code environments may have security requirements which impact performance – Potentially mixing programming models within one application Numeric accuracy and stability Copyright Microsoft Corporation, All Rights Reserved

Collaborations Within Microsoft – Evaluate early releases of programming models and runtime – Gather application requirements and plan demonstrations Research groups in academia – Share new many-core parallel algorithm ideas – UTK, Berkeley, and looking for others Copyright Microsoft Corporation, All Rights Reserved

Thank you! Copyright Microsoft Corporation, All Rights Reserved