ECE 599/692 – Deep Learning Lecture 4 – CNN: Practical Issues

Slides:



Advertisements
Similar presentations
Neural Networks and Kernel Methods
Advertisements

Lecture 5: CNN: Regularization
Lecture 4: CNN: Optimization Algorithms
ARTIFICIAL NEURAL NETWORKS. Overview EdGeneral concepts Areej:Learning and Training Wesley:Limitations and optimization of ANNs Cora:Applications and.
CS 189 Brian Chu Slides at: brianchu.com/ml/
Fundamentals of Artificial Neural Networks Chapter 7 in amlbook.com.
Dropout as a Bayesian Approximation
Convolutional Neural Network
Deep Learning Methods For Automated Discourse CIS 700-7
Outilne Temporal feature construction
Data Mining, Neural Network and Genetic Programming
Computer Science and Engineering, Seoul National University
The Problem: Classification
Many slides and slide ideas thanks to Marc'Aurelio Ranzato and Michael Nielson.
COMP24111: Machine Learning and Optimisation
Applications of Deep Learning and how to get started with implementation of deep learning Presentation By : Manaswi Advisor : Dr.Chinmay.
Machine Learning I & II.
Inception and Residual Architecture in Deep Convolutional Networks
Training Techniques for Deep Neural Networks
Machine Learning: The Connectionist
Machine Learning – Regression David Fenyő
CSC 578 Neural Networks and Deep Learning
ECE 599/692 – Deep Learning Lecture 6 – CNN: The Variants
Deep Learning Convoluted Neural Networks Part 2 11/13/
Fully Convolutional Networks for Semantic Segmentation
Bird-species Recognition Using Convolutional Neural Network
CSC 578 Neural Networks and Deep Learning
Face Recognition with Deep Learning Method
SBNet: Sparse Blocks Network for Fast Inference
CS 4501: Introduction to Computer Vision Training Neural Networks II
ECE 599/692 – Deep Learning Lecture 1 - Introduction
ECE 599/692 – Deep Learning Lecture 9 – Autoencoder (AE)
Tips for Training Deep Network
ECE 599/692 – Deep Learning Lecture 5 – CNN: The Representative Power
Hairong Qi, Gonzalez Family Professor
Hairong Qi, Gonzalez Family Professor
Chap. 7 Regularization for Deep Learning (7.8~7.12 )
Smart Robots, Drones, IoT
ECE599/692 - Deep Learning Lecture 14 – Recurrent Neural Network (RNN)
Classification of highly unbalanced data using deep learning techniques
Hairong Qi, Gonzalez Family Professor
Deep Learning for Non-Linear Control
TGS Salt Identification Challenge
Machine Learning – Neural Networks David Fenyő
Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824
Softmax Classifier.
ECE – Pattern Recognition Lecture 16: NN – Back Propagation
实习生汇报 ——北邮 张安迪.
Image Classification & Training of Neural Networks
Mihir Patel and Nikhil Sardana
Hairong Qi, Gonzalez Family Professor
Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824
Hairong Qi, Gonzalez Family Professor
Introduction to Neural Networks
Natalie Lang Tomer Malach
Batch Normalization.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Hairong Qi, Gonzalez Family Professor
ECE – Pattern Recognition Lecture 10 – Nonparametric Density Estimation – k-nearest-neighbor (kNN) Hairong Qi, Gonzalez Family Professor Electrical.
Debasis Bhattacharya, JD, DBA University of Hawaii Maui College
CSC 578 Neural Networks and Deep Learning
CSC 578 Neural Networks and Deep Learning
Hairong Qi, Gonzalez Family Professor
ECE – Pattern Recognition Lecture 8 – Performance Evaluation
ECE – Pattern Recognition Lecture 4 – Parametric Estimation
Hairong Qi, Gonzalez Family Professor
ECE – Pattern Recognition Lecture 14 – Gradient Descent
Principles of Back-Propagation
Overall Introduction for the Lecture
ECE – Pattern Recognition Midterm Review
Presentation transcript:

ECE 599/692 – Deep Learning Lecture 4 – CNN: Practical Issues Hairong Qi, Gonzalez Family Professor Electrical Engineering and Computer Science University of Tennessee, Knoxville http://www.eecs.utk.edu/faculty/qi Email: hqi@utk.edu AICIP stands for Advanced Imaging and Collaborative Information Processing

Outline Lecture 3: Core ideas of CNN Lecture 4: Practical issues Receptive field Pooling Shared weight Derivation of BP in CNN Lecture 4: Practical issues Normalized input and initialization of hyperparameters Cross validation Momentum Learning rate Activation functions Pooling strategies Regularization Lecture 5: Variants of CNN From LeNet to AlexNet to GoogleNet to VGG to ResNet Lecture 6: Implementation Lecture 7: Applications of CNN – Binary hashing Lecture 8: Applications of CNN – Person re-identification

Cost functions Least square quadratic Sigmoid and cross-entropy cost The learning slowdown problem Sigmoid and cross-entropy cost How does it solve the slowdown problem? Softmax and log-likelihood cost

Activation functions Sigmoid Softmax Tanh ReLU LeakyReLU

Regularization methods The problem of overfitting Weight decay L2 normalization L1 normalization Dropout Artificial expansion of the training data

Weight initialization Gaussian distribution with different std

How to choose hyper-parameters? The learning rate h Early stopping Learning rate schedules Regularization parameter l Mini-batch size m Grid search and the automated technique

Learning rates Always use smaller rates From [Duda&Hart:2001]

Momentum Taking into account of previous changes From [Duda&Hart:2001]