SU YUXIN JAN 20, 2014 Petuum: An Iterative-Convergent Distributed Machine Learning Framework.

Slides:

Advertisements

Similar presentations

Piccolo: Building fast distributed programs with partitioned tables Russell Power Jinyang Li New York University.

Advertisements

WHAT IS AN OPERATING SYSTEM? An interface between users and hardware - an environment "architecture ” Allows convenient usage; hides the tedious stuff.

Distributed Parameter Synchronization in DNN

The Hadoop RDBMS Replace Oracle with Hadoop John Leach CTO and Co-Founder J.

Spark: Cluster Computing with Working Sets

Computer Architecture Introduction to MIMD architectures Ola Flygt Växjö University

Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.

Clydesdale: Structured Data Processing on MapReduce Jackie.

Yucheng Low Aapo Kyrola Danny Bickson A Framework for Machine Learning and Data Mining in the Cloud Joseph Gonzalez Carlos Guestrin Joe Hellerstein.

Piccolo – Paper Discussion Big Data Reading Group 9/20/2010.

Meanwhile RAM cost continues to drop Moore’s Law on total CPU processing power holds but in parallel processing… CPU clock rate stalled… Because.

Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,

1 Distributed Computing Algorithms CSCI Distributed Computing: everything not centralized many processors.

1 Virtual Private Caches ISCA’07 Kyle J. Nesbit, James Laudon, James E. Smith Presenter: Yan Li.

Introduction to Systems Architecture Kieran Mathieson.

Key-Key-Value Stores for Efficiently Processing Graph Data in the Cloud Alexander G. Connor Panos K. Chrysanthis Alexandros Labrinidis Advanced Data Management.

Review for Test 2 i206 Fall 2010 John Chuang. 2 Topics  Operating System and Memory Hierarchy  Algorithm analysis and Big-O Notation  Data structures.

OPERATING SYSTEM OVERVIEW

Java Implementation of Petuum Yuxin Su September 2, 2014.

Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Except as otherwise noted, the content of this presentation is licensed under the Creative.

Network Support for Cloud Services Lixin Gao, UMass Amherst.

Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.

Course Outline DayContents Day 1 Introduction Motivation, definitions, properties of embedded systems, outline of the current course How to specify embedded.

General What is an OS? What do you get when you buy an OS? What does the OS do? What are the parts of an OS? What is the kernel? What is a device.

Operating System Review September 10, 2012Introduction to Computer Security ©2004 Matt Bishop Slide #1-1.

Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.

Task Scheduling for Highly Concurrent Analytical and Transactional Main-Memory Workloads Iraklis Psaroudakis (EPFL), Tobias Scheuer (SAP AG), Norman May.

M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.

Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.

Submitted by: Shailendra Kumar Sharma 06EYTCS049.

A Distributed Framework for Correlated Data Gathering in Sensor Networks Kevin Yuen, Ben Liang, Baochun Li IEEE Transactions on Vehicular Technology 2008.

Understanding Operating Systems Flynn & McHoes

An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.

MRPGA ： An Extension of MapReduce for Parallelizing Genetic Algorithm Reporter ：古乃卉.

An I/O Simulator for Windows Systems Jalil Boukhobza, Claude Timsit 27/10/2004 Versailles Saint Quentin University laboratory.

Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.

Embedding Constraint Satisfaction using Parallel Soft-Core Processors on FPGAs Prasad Subramanian, Brandon Eames, Department of Electrical Engineering,

1: Operating Systems Overview 1 Jerry Breecher Fall, 2004 CLARK UNIVERSITY CS215 OPERATING SYSTEMS OVERVIEW.

Executing Parallel Programs with Potential Bottlenecks Efficiently Yoshihiro Oyama Kenjiro Taura Akinori Yonezawa {oyama, tau,

Silberschatz, Galvin and Gagne  Operating System Concepts Operating Systems 1. Overview 2. Process Management 3. Storage Management 4. I/O Systems.

Virtualization and Databases Ashraf Aboulnaga University of Waterloo.

PETUUM A New Platform for Distributed Machine Learning on Big Data

Particle Swarm Optimization † Spencer Vogel † This presentation contains cheesy graphics and animations and they will be awesome.

Operating Systems: Internals and Design Principles

Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉教授 : 許毅然作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.

A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.

Large Scale Distributed Distance Metric Learning by Pengtao Xie and Eric Xing PRESENTED BY: PRIYANKA.

1/31/20161 Final Exam Dec 10. Monday. 4-7pm. Phelp 1160 Similar to midterm The exam is closed book. You can bring 2 page of notes (double sided) Nachos.

Threaded Programming Lecture 1: Concepts. 2 Overview Shared memory systems Basic Concepts in Threaded Programming.

Real-Time Operating System Design

Solving the straggler problem with bounded staleness Jim Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth Gibson, Kimberly Keeton*,

Experimental Perspectives on Lasso-related Algorithms on Parallel Computing Frameworks

Bigtable: A Distributed Storage System for Structured Data Google Inc. OSDI 2006.

ECE 692 Power-Aware Computer Systems Final Review Prof. Xiaorui Wang.

Managed Communication and Consistency for Fast Data- Parallel Iterative Analytics Jinliang WeiWei DaiAurick QiaoQirong HoHenggang Cui Gregory R. GangerPhillip.

COMP7330/7336 Advanced Parallel and Distributed Computing MapReduce - Introduction Dr. Xiao Qin Auburn University

Oracle Database Architectural Components

Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit

BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.

BD-Cache: Big Data Caching for Datacenters

About Hadoop Hadoop was one of the first popular open source big data technologies. It is a scalable fault-tolerant system for processing large datasets.

Distributed Network Traffic Feature Extraction for a Real-time IDS

Distributed Computation Framework for Machine Learning

IBM INFORMIX online Training in Hyderabad

Distributed Systems CS

CS110: Discussion about Spark

Pregelix: Think Like a Vertex, Scale Like Spandex

Overview of big data tools

User-level Distributed Shared Memory

Distributed Systems CS

Presentation transcript:

SU YUXIN JAN 20, 2014 Petuum: An Iterative-Convergent Distributed Machine Learning Framework

Outline Introduction Implementation Questions Demo

Introduction to Petuum

Bulk Synchronous Parallel

Asynchronous Parameters read / update at any time

Stale Synchronous Parallel

Convergence

Programming read(table, row, col) inc(table, row, col, value) iteration()

Implementation

Overview in Logic

Overview in the Real

Main Components

Table

ConsistencyController::DoGet()

ConsistencyController::iterate()

Server::GetRow()

Least-Recently-Used(LRU) Strategy

Questions

Is Lock-Free Possible ? Data exchange in real-time ? next …

Is Auto-Rescheduling Possible ? sub-centralized server reduce communication cost

Is Auto-Partition Possible ? Run ML algorithms like that in a single thread A Solution for all ML algorithms

In-Memory or In-Storage ? Data capacity is greater than memory size. Memory should be a cache for disk storage. Solution for disk storage:  Hadoop  Spark  ….

New Schema to Reduce the Upper Bound?

STRADS Scheduler Variable Correlations  Auto-Parallelization Dynamic Prioritization  Monitor the contribution of variables to objective function Load-Balancing in Task

Demo Switch to my laptop …