ECE 599/692 – Deep Learning Lecture 4 – CNN: Practical Issues

Slides:

Advertisements

Similar presentations

Neural Networks and Kernel Methods

Advertisements

Lecture 5: CNN: Regularization

Lecture 4: CNN: Optimization Algorithms

ARTIFICIAL NEURAL NETWORKS. Overview EdGeneral concepts Areej:Learning and Training Wesley:Limitations and optimization of ANNs Cora:Applications and.

CS 189 Brian Chu Slides at: brianchu.com/ml/

Fundamentals of Artificial Neural Networks Chapter 7 in amlbook.com.

Dropout as a Bayesian Approximation

Convolutional Neural Network

Deep Learning Methods For Automated Discourse CIS 700-7

Outilne Temporal feature construction

Data Mining, Neural Network and Genetic Programming

Computer Science and Engineering, Seoul National University

The Problem: Classification

Many slides and slide ideas thanks to Marc'Aurelio Ranzato and Michael Nielson.

COMP24111: Machine Learning and Optimisation

Applications of Deep Learning and how to get started with implementation of deep learning Presentation By : Manaswi Advisor : Dr.Chinmay.

Machine Learning I & II.

Inception and Residual Architecture in Deep Convolutional Networks

Training Techniques for Deep Neural Networks

Machine Learning: The Connectionist

Machine Learning – Regression David Fenyő

CSC 578 Neural Networks and Deep Learning

ECE 599/692 – Deep Learning Lecture 6 – CNN: The Variants

Deep Learning Convoluted Neural Networks Part 2 11/13/

Fully Convolutional Networks for Semantic Segmentation

Bird-species Recognition Using Convolutional Neural Network

CSC 578 Neural Networks and Deep Learning

Face Recognition with Deep Learning Method

SBNet: Sparse Blocks Network for Fast Inference

CS 4501: Introduction to Computer Vision Training Neural Networks II

ECE 599/692 – Deep Learning Lecture 1 - Introduction

ECE 599/692 – Deep Learning Lecture 9 – Autoencoder (AE)

Tips for Training Deep Network

ECE 599/692 – Deep Learning Lecture 5 – CNN: The Representative Power

Hairong Qi, Gonzalez Family Professor

Hairong Qi, Gonzalez Family Professor

Chap. 7 Regularization for Deep Learning (7.8~7.12 )

Smart Robots, Drones, IoT

ECE599/692 - Deep Learning Lecture 14 – Recurrent Neural Network (RNN)

Classiﬁcation of highly unbalanced data using deep learning techniques

Hairong Qi, Gonzalez Family Professor

Deep Learning for Non-Linear Control

TGS Salt Identification Challenge

Machine Learning – Neural Networks David Fenyő

Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824

Softmax Classifier.

ECE – Pattern Recognition Lecture 16: NN – Back Propagation

实习生汇报 ——北邮张安迪.

Image Classification & Training of Neural Networks

Mihir Patel and Nikhil Sardana

Hairong Qi, Gonzalez Family Professor

Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824

Hairong Qi, Gonzalez Family Professor

Introduction to Neural Networks

Natalie Lang Tomer Malach

Batch Normalization.

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Hairong Qi, Gonzalez Family Professor

ECE – Pattern Recognition Lecture 10 – Nonparametric Density Estimation – k-nearest-neighbor (kNN) Hairong Qi, Gonzalez Family Professor Electrical.

Debasis Bhattacharya, JD, DBA University of Hawaii Maui College

CSC 578 Neural Networks and Deep Learning

CSC 578 Neural Networks and Deep Learning

Hairong Qi, Gonzalez Family Professor

ECE – Pattern Recognition Lecture 8 – Performance Evaluation

ECE – Pattern Recognition Lecture 4 – Parametric Estimation

Hairong Qi, Gonzalez Family Professor

ECE – Pattern Recognition Lecture 14 – Gradient Descent

Principles of Back-Propagation

Overall Introduction for the Lecture

ECE – Pattern Recognition Midterm Review

Presentation transcript:

ECE 599/692 – Deep Learning Lecture 4 – CNN: Practical Issues Hairong Qi, Gonzalez Family Professor Electrical Engineering and Computer Science University of Tennessee, Knoxville http://www.eecs.utk.edu/faculty/qi Email: hqi@utk.edu AICIP stands for Advanced Imaging and Collaborative Information Processing

Outline Lecture 3: Core ideas of CNN Lecture 4: Practical issues Receptive field Pooling Shared weight Derivation of BP in CNN Lecture 4: Practical issues Normalized input and initialization of hyperparameters Cross validation Momentum Learning rate Activation functions Pooling strategies Regularization Lecture 5: Variants of CNN From LeNet to AlexNet to GoogleNet to VGG to ResNet Lecture 6: Implementation Lecture 7: Applications of CNN – Binary hashing Lecture 8: Applications of CNN – Person re-identification

Cost functions Least square quadratic Sigmoid and cross-entropy cost The learning slowdown problem Sigmoid and cross-entropy cost How does it solve the slowdown problem? Softmax and log-likelihood cost

Activation functions Sigmoid Softmax Tanh ReLU LeakyReLU

Regularization methods The problem of overfitting Weight decay L2 normalization L1 normalization Dropout Artificial expansion of the training data

Weight initialization Gaussian distribution with different std

How to choose hyper-parameters? The learning rate h Early stopping Learning rate schedules Regularization parameter l Mini-batch size m Grid search and the automated technique

Learning rates Always use smaller rates From [Duda&Hart:2001]

Momentum Taking into account of previous changes From [Duda&Hart:2001]