Progress Report 2016/12/28.

Slides:



Advertisements
Similar presentations
Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
Advertisements

Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
NDN in Local Area Networks Junxiao Shi The University of Arizona
Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.
Distributed Process Scheduling Summery Distributed Process Scheduling Summery BY:-Yonatan Negash.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
HOL9396: Oracle Event Processing 12c
Research Directions for the Internet of Things Supervised by: Dr. Nouh Sabry Presented by: Ahmed Mohamed Sayed.
CS Dept, City Univ.1 Research Issues in Wireless Sensor Networks Prof. Xiaohua Jia Dept. of Computer Science City University of Hong Kong.
Data Center Infrastructure
Tufts Wireless Laboratory School Of Engineering Tufts University “Network QoS Management in Cyber-Physical Systems” Nicole Ng 9/16/20151 by Feng Xia, Longhua.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
COLLABORATIVE SPECTRUM MANAGEMENT FOR RELIABILITY AND SCALABILITY Heather Zheng Dept. of Computer Science University of California, Santa Barbara.
Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,
Adaptive software in cloud computing Marin Litoiu York University Canada.
Wireless Networks Breakout Session Summary September 21, 2012.
Challenges towards Elastic Power Management in Internet Data Center.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
IoT Primer Stephen Bates | Energy Huntsville: Tues 15 Dec
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
Internet of Things in Industries
Web Technologies Lecture 13 Introduction to cloud computing.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Internet of Things. Creating Our Future Together.
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
TWOJA CYFROWA PRZYSZŁOŚĆ. JUŻ DZISIAJ. Christoph F. Strnadl CTO Central & Eastern Europe 11 May 2016.
Embedded Database Benchmark Team CodeBlooded. Internet of Things “As the number of interconnected platforms continues to multiply, vendors and customers.
Green IT: Sustainability A History Computing Research: Roles and Opportunities for Information Technology in Meeting Sustainability Challenges.
ScotGRID is the Scottish prototype Tier 2 Centre for LHCb and ATLAS computing resources. It uses a novel distributed architecture and cutting-edge technology,
INTRODUCTION TO WIRELESS SENSOR NETWORKS
Connected Infrastructure
TensorFlow– A system for large-scale machine learning
OPERATING SYSTEMS CS 3502 Fall 2017
Resource Management IB Computer Science.
Architecture and Algorithms for an IEEE 802
Data Center Infrastructure
Threads vs. Events SEDA – An Event Model 5204 – Operating Systems.
Netscape Application Server
Introduction to Load Balancing:
Breakout Session 3 Alex, Mirco, Vojtech, Juraj, Christoph
Interactive Machine Learning with a GPU-Accelerated Toolkit
Engineers Gate Founded in early 2014 by team led by Glenn Dubin
Work-in-Progress: Wireless Network Reconfiguration for Control Systems
Data Dissemination and Management (2) Lecture 10
Connected Infrastructure
Introduction to Operating System (OS)
Algorithms for Big Data Delivery over the Internet of Things
Software Architecture in Practice
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
April 30th – Scheduling / parallel
Department of Computer Science University of California, Santa Barbara
RM3G: Next Generation Recovery Manager
Speaker: Jin-Wei Lin Advisor: Dr. Ho-Ting Wu
Smart Learning concepts to enhance SMART Universities in Africa
Horizontally Partitioned Hybrid Main Memory with PCM
(A Research Proposal for Optimizing DBMS on CMP)
Declarative Transfer Learning from Deep CNNs at Scale
Subject Name: Operating System Concepts Subject Number:
Nenad Stefanovic and Danijela Milosevic
Operating System Introduction.
TensorFlow: A System for Large-Scale Machine Learning
Department of Computer Science University of California, Santa Barbara
Kostas Kolomvatsos, Christos Anagnostopoulos
Rohan Yadav and Charles Yuan (rohany) (chenhuiy)
Utilizing the Network Edge
EdgeWise: A Better Stream Processing Engine for the Edge
ISO Open system Requirements and IT Needs
Data Dissemination and Management (2) Lecture 10
Presentation transcript:

Progress Report 2016/12/28

ICPADS’16 Keynote Database Meets Deep Learning: Challenges and Opportunities IoT: Towards a Connected Era—Research Direction and Social Impacts On Sensorless Sensing for the Internet of Everything Theory and Optimization of Multi-core Memory Performance

Database Meets Deep Learning: Challenges and Opportunities Deep Learning(DL) is difficult to: Understand (its effectiveness) Tune the hyper-parameters Optimize (the training speed and memory usage) Apply database techniques to DL For efficiency optimization For memory optimization Why? Database operations are “predictable”.

Efficiency Optimization To improve the training speed of DL on a single device(GPU or CPU device) All operations of one iteration create a dataflow graph. Existing systems either do static(Theano and TensorFlow) or dynamic(MxNet) dependency analysis to parallelize operations without data dependencies. Possible improvements: When there are limited resources, e.g. executors(CUDA streams), there could be multiple ways of placing the operations onto the executors. Runtime optimization by 1) collecting the cost(i.e. FLOPS) of each operation and hardware statistics 2) estimating the total cost of all plans.

Memory Optimization Optimize memory footprint for DL Memory is used by Parameter values Parameter gradients Hidden layer values Hidden layer gradients GPUs and other new hardware have less memory than CPUs. Memory pool and garbage collection Release memory if it is not used in the rest of one iteration. Frequent malloc/free is costly for GPUs. Memory pool could help. Swap data between GPU and CPU memory Statically done by users. Dynamically done by system using the paging strategies? Drop some hidden layer values and redo the computation to recover them. Statically analyze the dataflow to decide which one to drop. Dynamically determine which one to drop and recover it when necessary or in advance? Need to consider which layer to recovery and estimate the best time to recover.

Summary Deep learning is effective but difficult to optimize. DB optimization techniques could be adapted for DL systems. DL model can be applied to DB applications. Find optimal query plan. Apache SINGA: A distributed platform for DL. Optimizing speed, memory for single node. Optimizing communication/consistency for distributed training. DLaSS easy deployment and visualization.

IoT: Towards a Connected Era—Research Direction and Social Impacts Network society = Ubiquitous society From vertical integration to horizontal integration. IoT can help: Elderly care, traffic for elderly, local industry, agriculture revolution, smart city, … Toward a better society with open data.

Core Technology Big data processing Edge Computing Pushing the frontier of computing applications, data, and services away from centralized nodes to the logical extremes of a network. Enables analytics and knowledge generation to occur at the source of the data. Edge access network construction

Relay-by-Smartphone Delivering Messages Using only Wifi without Celler Coverage 1) https://www.youtube.com/watch?v=JbxKPrPF6JQ 2) https://www.youtube.com/watch?v=dVC4RLWMBng 3) https://www.youtube.com/watch?v=KXvphDBr5kE 4) https://www.youtube.com/watch?v=X7usb2ldHBs

On Sensorless Sensing for the Internet of Everything The barriers for long-term and large scale sensor networks and IoT: 1) Hardware cost and its maintenance 2) Energy Possible solutions: 1) Crowd-sourcing and participatory sensing. 2) backscatter to solve the energy problem.

Sensorless Sensing Applications: Benefits: Intrusion detection Healthcare monitoring Human-Computer Interaction Benefits: Wireless sensing without wire. Contactless sensing without wearable sensors. Sensorless sensing without dedicated sensors.

Projects CitySee 2011-ongoing 签信通 2011-ongoing GreenOrbs (绿野千传) 2009-ongoing http://www.greenorbs.org/ Trustworthy RFID (签若磐石) 2007-ongoing OceanSense, Sensor Network for Sea Monitoring 2007-2009 http://www.cse.ust.hk/~liu/Ocean/index.html SASA, Coal Mine Monitoring with Sensors 2005-2008

Theory and Optimization of Multi-core Memory Performance Memory is multi-layered, hierarchical and shared. Caching is dynamic sharing of fast memory. Parallel programs / workloads Large cache Run-time mediation between demand and supply Data-ranking / priority allocation Similar to the role of an OS scheduler for a CPU But cache intervenes at every load/store “The science of multi-core cache sharing”

Locality

Rochester Elastic Cache Utility (RECU) Trade off between Efficiency and Fairness. Provides an algorithm to choose a cache partitioning for a set of programs in order to optimize the predicted cache performance. With some fairness concessions based on a baseline of each program having an equal partition. Strict baseline.

Miss Ratio Elasticity and Cache Space Elasticity Elastic miss ratio baseline (EMB) The program's predicted miss ratio may increase by up to some percentage compared to what it would be with the strict baseline. Elastic cache space baseline (ECB) The program may yield at most a given percentage of its cache space to peer programs.

Evaluation Results

Evaluation Results(Cont.)

Other Works

Summary Algorithm + Data Structure + Locality = Efficient Programs. Locality science Formal, universal relations between locality metrics. Universal properties Monotonicity, concavity, composition invariance and symmetry Locality optimization Minimize data movement between Fast and slow memory Small and large memory Local and remote memory Optimal program and cache organization