Lecture 24: Model Hub.

Slides:

Advertisements

Similar presentations

Query Optimization of Frequent Itemset Mining on Multiple Databases Mining on Multiple Databases David Fuhry Department of Computer Science Kent State.

Advertisements

ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.

Workflow Exchange and Archival: The KSW File and the Kepler Object Manager Shawn Bowers (For Chad Berkley & Matt Jones) University of California, Davis.

Video retrieval using inference network A.Graves, M. Lalmas In Sig IR 02.

Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.

Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,

Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.

Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,

Issues with Data Mining

1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.

Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning

Software Engineering CS3003 Lecture 3 Software maintenance and evolution.

Division of IT Convergence Engineering Towards Unified Management A Common Approach for Telecommunication and Enterprise Usage Sung-Su Kim, Jae Yoon Chung,

Database System Concepts and Architecture

Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.

An Integration Framework for Sensor Networks and Data Stream Management Systems.

DeepFont: Large-Scale Real-World Font Recognition from Images

The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.

A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.

What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.

A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.

1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.

Clinical Quality Language (CQL) Bryn Rhodes Chris Moesel Mark Kramer.

1 CloudVS: Enabling Version Control for Virtual Machines in an Open- Source Cloud under Commodity Settings Chung-Pan Tang, Tsz-Yeung Wong, Patrick P. C.

Page 1 Inferring Relevant Social Networks from Interpersonal Communication Munmun De Choudhury, Winter Mason, Jake Hofman and Duncan Watts WWW ’10 Summarized.

1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,

Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.

Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.

Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.

Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.

Information Design Trends Unit 4 : Sources and Standards Lecture 1: Content Management Part 1.

Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,

Gorilla: A Fast, Scalable, In-Memory Time Series Database

CSC314 Day 16 Transaction processing Concurrency Control 1.

DOWeR Detecting Outliers in Web Service Requests Master’s Presentation of Christian Blass.

Databases and DBMSs Todd S. Bacastow January 2005.

DeepFont: Identify Your Font from An Image

Implementing Boosting and Convolutional Neural Networks For Particle Identification (PID) Khalid Teli .

PDAP Query Language International Planetary Data Alliance

Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.

The Google File System Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung Google Presented by Jiamin Huang EECS 582 – W16.

What’s New in Colectica 5.3 Part 1

Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang

Chapter 2 Database Environment Pearson Education © 2009.

Lecture 23: Feature Selection

Machine Learning Platform Life-Cycle Management

11. The future of SDMX Introducing the SDMX Roadmap 2020

An Adaptive Middleware for Supporting Time-Critical Event Response

Chap. 7 Regularization for Deep Learning (7.8~7.12 )

Word Embedding Word2Vec.

Internet Protocols IP: Internet Protocol

Outline Background Motivation Proposed Model Experimental Results

ML – Lecture 3B Deep NN.

Deep Robust Unsupervised Multi-Modal Network

Ying Dai Faculty of software and information science,

Ying Dai Faculty of software and information science,

Tuning CNN: Tips & Tricks

Metadata The metadata contains

Ying Dai Faculty of software and information science,

Database System Concepts and Architecture

Word embeddings (continued)

Towards Unified Management

A platform for the Complete Machine Learning Lifecycle

Natural Language Processing (NLP) Systems Joseph E. Gonzalez

Chapter 2 Database Environment Pearson Education © 2009.

Oracle SQL Developer Data Modeler

Multiple DAGs Learning with Non-negative Matrix Factorization

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

Report 7 Brandon Silva.

Presentation transcript:

Lecture 24: Model Hub

Announcements No class on Thursday: Push for the projects. Presentations next week: 15+3 minutes per group Should be like a conference presentation Motivation, problem + background, approach, results Tuesday 5/1: Groups 1 - 4 Thursday 5/3: Groups 5 - 8

Deep learning everywhere Everyone is using DL models Multiple users/models even for the same task Train, evaluate, and serve.

Beyond training Current tools focus on model building, training, and evaluation of single model How do we reason about model variations? Different architectures, features Questions/Tasks: How to understand and compare models? How to adjust models? Hyperparameter tuning Model versioning: reuse weights for warm start Parameter archiving Model results

ModelHub ModelHub: Manage DL lifecycle artifacts. Model versioning: store and query models Model network adjustment and hyper parameter tuning Deep learning model sharing to exchange model repositories and enable publishing, sharing and reuse.

Representing DNNs Function relationships (fi, fi-1). A DNN is a DAG Layers or arithmetic operations can be functions

Version Control System A model version: Network definition Collection of weights Metadata: hyperparameters, accuracy, loss Other files: scripts, datasets Use the versioning ideas from Datahub

Model versioning operations Model exploration queries Model enumeration queries Remote Interaction

Query Facilities: Explorations Queries List Models and related lineages Obtain commit descriptions and parent versions Describe model Show extracted metadata from a model version Compare models diff over models: compare DNN models (architectures) Evaluate a model

Query Facilities: EnumerationQueries Select models from repo Slice models to get components Construct new models by mutating existing ones Try new configs on existing models Design a domain-specific language for ModelHub. Based on SQL.

DQL examples

DQL examples

DQL examples

DQL examples

Parameter Archival Storage Use a version graph Snapshots corresponds to model instances during training Nodes V correspond to model versions Question: How do you store all this data?

Challenges Similarity among fine-tuned models (e.g., similar parameters) Co-usage constraints Low-precision tolerance (DNNs tolerate small variances) Unbalanced access frequencies (not all models are great)

Float Data Use standard tricks: Still high-entropy of float numbers Lower the precision Used fixed point encoding Quantization Still high-entropy of float numbers Separate high-order and low-order bits Float matrix is stored in multiple chuncks:8 high-order bits first rest are segmented in one byte per chunk. Delta encoding between checkpointed snapshot in one model and latest snapshots across multiple model versions.

Matrix Storage and Graph Plan Same problems now as versioning control we saw before. Optimize for average snapshot recreation or storage cost

Model Evaluation Apply DNN on a data point to get the prediction For exploration: Decompress and evaluate a model Expensive! Segmented parameters: you know min – max for that range If prediction is the same then no need to access lower order bytes Think of stability

Model Evaluation

Progressive Model Evaluation

Evaluation

Evaluation

Conclusions

Reminder No class on Thursday: Push for the projects. Presentations next week: 15+3 minutes per group Should be like a conference presentation Motivation, problem + background, approach, results Tuesday 5/1: Groups 1 - 4 Thursday 5/3: Groups 5 - 8