Please Note: IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion.

Slides:



Advertisements
Similar presentations
© 2013 Sri U-Thong Limited. All rights reserved. This presentation has been prepared by Sri U-Thong Limited and its holding company (collectively, “Sri.
Advertisements

© 2014 Microsoft Corporation. All rights reserved.
Innovations in Structured Products October 25, 2010 An Innovator’s Dilemma?
© 2010 IBM Corporation ® IBM Software Group Assistive Technology As applied to the workplace Niamh Foley.
Data Analysis and Visualization Dr. Frank van Ham, IBM Netherlands Target Conference 2014, Groningen Nov 4 th, 2014.
© 2014 IBM Corporation IBM Tivoli Storage Manager Virtual Appliance Smarter Data Protection for Cloud Environments Cyrus Niltchian, Product Management.
® IBM Software Group © 2009 IBM Corporation Updated August 15, 2015 WebSphere Enterprise Service Bus WebSphere Integration Developer Mediation.
Rajeev Gollapudi SAP Labs India Steven Pitschke IBM Rational
Click to add text © 2012 IBM Corporation 1 Streams Toolkit Landscape InfoSphere Streams Version 3.0 Mike Branson Toolkits.
® IBM Software Group © 2012 IBM Corporation OPTIM Data Studio – Jon Sayles, IBM/Rational November, 2012.
1 Mobile Document Capture using Apple iPhone and IBM Content Navigator October, 2012.
Title Slide – Option 1. Title Slide – Option 2 Insert Text.
End User License Agreement Permission to use and redistribute this Document is granted, provided that (1) the below copyright notice appears in all copies.
IBM Software Group AIM Enterprise Platform Software IBM z/Transaction Processing Facility Enterprise Edition © IBM Corporation 2005 TPF Users Group.
Building Cognitive Apps with IBM Watson on Bluemix
International Telecommunication Union New Delhi, India, December 2011 ITU Workshop on Standards and Intellectual Property Rights (IPR) Issues Philip.
The Drive to Improved Performance/watt and Increasing Compute Density Steve Pawlowski Intel Senior Fellow GM, Architecture and Planning CTO, Digital Enterprise.
© 2011 IBM Corporation January 2011 Pam Denny, IBM V7 Reporting.
Enhancement Package Innovations Gabe Rodriguez - Halliburton Stefan Kneis – SAP Marco Valencia - SAP.
Z/TPF EE V1.1 z/TPFDF V1.1 TPF Toolkit for WebSphere® Studio V3 TPF Operations Server V1.2 IBM Software Group AIM Enterprise Platform Software IBM z/Transaction.
Oracle Fusion Applications 11gR1 ( ) Functional Overview (L2) Manage Inbound Logistics (L3) Manage Receipts.
IBM Software Group AIM Enterprise Platform Software IBM z/Transaction Processing Facility Enterprise Edition © IBM Corporation 2005 TPF Users Group.
© 2015 IBM Corporation Big Data Journey. © 2015 IBM Corporation 2.
Oracle Fusion Applications 11gR1 ( ) Functional Overview (L2) Manage Inbound Logistics (L3) Manage Supplier Returns.
Click to add text © 2012 IBM Corporation 1 InfoSphere Streams Streams Console Applications InfoSphere Streams Version 3.0 Warren Acker InfoSphere Streams.
Oracle Fusion Applications 11gR1 ( ) Functional Overview (L2) Manage Inbound Logistics (L3) Manage and Disposition Inventory Returns.
IBM eServer iSeries © 2003 IBM Corporation ™™ iSeries Solutions for Business Continuity IBM eServerJ iSeriesJ © 2003 IBM Corporation.
Oracle Fusion Applications 11gR1 ( ) Functional Overview (L2) Manage Inbound Logistics (L3) Inspect Material.
Click to add text © 2012 IBM Corporation 1 Streams Console Application Graph Michael Pfeifer Streams Admin Console.
Oracle E-Business Suite R12.1 Accounts Payables Partner Boot Camp Training Courseware Part VIII – Transaction Taxes in Payables.
For Oracle employees and authorized partners only. Do not distribute to third parties. © 2008 Oracle Corporation – Proprietary and Confidential.
1 of 26 For Oracle employees and authorized partners only. Do not distribute to third parties. © 2009 Oracle Corporation – Proprietary and Confidential.
End User License Agreement Permission to use and redistribute this Document is granted, provided that (1) the below copyright notice appears in all copies.
Showdown at the Mobile Corral Tim Choo / Feb 2, 2016.
-1- For Oracle employees and authorized partners only. Do not distribute to third parties. © 2009 Oracle Corporation – Proprietary and Confidential Oracle.
1451 – Moving to IBM Verse & wondering how to leverage your existing Domino application infrastructure? Raj Patil, Senior Technical Staff Member February.
Click to edit Master subtitle style © 2015 IBM Corporation Liberty Elastic Clusters and Centralized Administration Using Scripting and Admin Center Lab.
I want stress-free IT. i want control. i want an i. IBM System i ™ Session: Secure Perspective Patrick Botz IBM Lab Services Security Architecture Consulting.
IBM Systems Group © 2004 IBM Corporationv 3.04 This presentation is intended for the education of IBM and Business Partner sales personnel. It should not.
1 © 2016 IBM Corporation Mobile Device Management Manage smartphones, tablets & laptops featuring iOS, Android, Windows Phone, BlackBerry, Windows PC &
IBM Innovate 2012 Title Presenter’s Name Presenter’s Title, Organization Presenter’s Address Session Track Number (if applicable)
IMPORTANT info regarding IBM speaker guidelines and disclaimers If your presentation has forward looking content, it is mandatory that you put the forward.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Oracle Proprietary and Confidential. 1.
Work smarter, keep connected with Lotus Software Jon Crouch | Senior Technical Specialist, Lotus Software Matt Newton | Senior Technical Specialist, Lotus.
IBM mail support for MS Outlook Today, Tomorrow, Cloud and OnPrem Luis Guirigay WW Executive IT Specialist Barry Rosen.
Outthink threats The next era of security. Marc van Zadelhoff General Manager, IBM Security.
Please Note: IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion.
PowerAI Scott Soutter Offering Manager: PowerAI and High Performance Data Analytics
Using Parallelspace TEAM Models to Design and Create Custom Profiles
BigFix Patch for Linux Overview of the RHSM Download Plug-in and the Multiple-Package Baseline Installation feature Chuxin Zhao.
Virtualization Engine console Bridge Concepts
IBM System z9 109 Availability Eye Opener
Parallelspace PowerPoint Template for ArchiMate® 2.1 version 1.1
Parallelspace PowerPoint Template for ArchiMate® 2.1 version 2.0
Many-core Software Development Platforms
Go Off Grid ➔ Go Graph! Jason October 24, 2016
Connections AppDev: Building at the Speed of Pink
Apache Atlas October 2016.
IBM Blockchain An Enterprise Deployment of a Distributed Consensus-based Transaction Log Ben Smith & Kostantinos Christidis 1 ©2016 IBM Corporation.
IBM Global Technology Services
Motivation for 36OU Open Rack
What YOUR ORGANIZATION CAN be doing to prepare
IBM OpenPages Developer WYNTK
Global Technology Services
© 2013 Sri U-Thong Limited. All rights reserved
Presentation transcript:

Accelerating Machine Learning Applications on Spark Using GPUs Wei Tan, Liana Fong Other contributors: Minisk Cho, Rajesh Bordawekar T. J. Watson Research Center October 25

Please Note: IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here.

Background: Apache Spark and MLlib An in memory engine for large-scale data processing Used in database, stream, machine learning and graph processing iter. 1 iter. 2 . . . Input

Background: Apache Spark and MLlib Recommendation Trees Classification (LR, SVM…) Clustering … …

Background: GPU computing Xeon e5 2687 CPU Tesla K40 GPU GPU is with: Slower clock, fewer cache: not optimized for latency More transistors to compute Higher flops and memory bw Optimized for data-parallel, high-throughput workload

Background: Apache Spark and MLlib + (GPU) connectors and libs? Recommendation Trees Classification (LR, SVM…) Clustering … …

Problem: large-scale matrix factorization Why Recommendation important in cognitive applications Digital ads market in US: 37.3 b*: Spark/Facebook/IBM Commerce Need a fast and scalable solution *Market research of commercial recommendation engines for online and offline retail. http://dspace.mit.edu/handle/1721.1/90218 . MIT, 2014

Problem: large-scale matrix factorization Why Factorize the word co-occurrence matrix as rating matrix Obtain word features that embeds semantics man – woman = king – queen = brother – sister …. *Market research of commercial recommendation engines for online and offline retail. http://dspace.mit.edu/handle/1721.1/90218 . MIT, 2014

MF: the state-of-art Many systems optimized for medium- sized problems; very few target at huge problems. Distributed solutions are slow. Do not roofline CPU performance Do not optimize communication Distributed solutions need a lot of resources and cost.

MF: what we what to achieve Scale to problems of any size. Fast. Cost-efficient.

Solution: cuMF - ALS on a machine with GPUs On one GPU GPU (Nvidia K40): Memory BW: 288 GB/sec, compute: 5 Tflops/sec Memory slower than compute  need to optimize memory access! The roofline model Higher Gflops  higher op intensity (more flops per byte)  caching! 5T 17 × Gflops/s 1 288G × Operational intensity (Flops/Byte)

Solution: cuMF - ALS on a machine with GPUs MO-ALS on one GPU: Memory-Optimized ALS Access many θv columns: irregular due to R’s sparseness Aggregate many θvθvTs: memory intensive

Solution: cuMF - ALS on a machine with GPUs Texture memory to smooth dis-contiguous, irregular memory access Register memory to hold hotspot variables

Solution: cuMF - ALS on a machine with GPUs On multiple GPUs Exploit data & model parallelism Data parallelism: solve using a portion of the training data Model parallelism: solve a portion of the model Exploit connection topology to minimize communication overhead Data parallel model parallel

CuMF performance

CuMF Performance cuMF: ALS on a single machine with 2* Nvidia K80 (4 cards) Compared with state-of-art distributed solutions 6-10x as fast 33-100x as cost-efficient (cuMF costs $2.5 per hour on Softlayer) Able to factorize the largest matrix ever reported

CuMF Performance cuMF: ALS on a machine with one GPU 4x speedup as Spark ALS accelerator cuMF with Spark Spark ALS cuMF C MLlib Spark run-time

Roadmap Current work Future work Impressive acceleration of MF with GPUs on one machine GPU acceleration techniques with model and data parallelism Illustrated applicability of GPU acceleration to Spark/Mllib Performance evaluations on K40, K80 GPUs, Intel and Power Future work GPU acceleration of other ML algorithms in Mllib or others Acceleration of algorithms for multiple GPUs on single and across machines, with and without RDMA across machines Performance evaluation on other hardware, including Other GPUs such as Nvidia Maxwell Forthcoming NVLink across GPUs within a single machine

Notices and Disclaimers Copyright © 2015 by International Business Machines Corporation (IBM). No part of this document may be reproduced or transmitted in any form without written permission from IBM. U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM. Information in these presentations (including information relating to products that have not yet been announced by IBM) has been reviewed for accuracy as of the date of initial publication and could include unintentional technical or typographical errors. IBM shall have no responsibility to update this information. THIS document is distributed "AS IS" without any warranty, either express or implied. In no event shall IBM be liable for any damage arising from the use of this information, including but not limited to, loss of data, business interruption, loss of profit or loss of opportunity. IBM products and services are warranted according to the terms and conditions of the agreements under which they are provided. Any statements regarding IBM's future direction, intent or product plans are subject to change or withdrawal without notice. Performance data contained herein was generally obtained in a controlled, isolated environments. Customer examples are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual performance, cost, savings or other results in other operating environments may vary. References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Workshops, sessions and associated materials may have been prepared by independent session speakers, and do not necessarily reflect the views of IBM. All materials and discussions are provided for informational purposes only, and are neither intended to, nor shall constitute legal or other guidance or advice to any individual participant or their specific situation. It is the customer’s responsibility to insure its own compliance with legal requirements and to obtain advice of competent legal counsel as to the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer’s business and any actions the customer may need to take to comply with such laws. IBM does not provide legal advice or represent or warrant that its services or products will ensure that the customer is in compliance with any law.

Notices and Disclaimers (con’t) Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products in connection with this publication and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. IBM does not warrant the quality of any third-party products, or the ability of any such third-party products to interoperate with IBM’s products. IBM expressly disclaims all warranties, expressed or implied, including but not limited to, the implied warranties of merchantability and fitness for a particular purpose. The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents, copyrights, trademarks or other intellectual property right. IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS, Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document Management System™, FASP®, FileNet®, Global Business Services ®, Global Technology Services ®, IBM ExperienceOne™, IBM SmartCloud®, IBM Social Business®, Information on Demand, ILOG, Maximo®, MQIntegrator®, MQSeries®, Netcool®, OMEGAMON, OpenPower, PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®, PureExperience®, PureFlex®, pureQuery®, pureScale®, PureSystems®, QRadar®, Rational®, Rhapsody®, Smarter Commerce®, SoDA, SPSS, Sterling Commerce®, StoredIQ, Tealeaf®, Tivoli®, Trusteer®, Unica®, urban{code}®, Watson, WebSphere®, Worklight®, X-Force® and System z® Z/OS, are trademarks of International Business Machines Corporation, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at: www.ibm.com/legal/copytrade.shtml.

Thank You