The Problem: Classification

Slides:



Advertisements
Similar presentations
Lecture 6: Classification & Localization
Advertisements

Classification spotlights
ImageNet Classification with Deep Convolutional Neural Networks
Spatial Pyramid Pooling in Deep Convolutional
Hands-on course in deep neural networks for vision
Fully Convolutional Networks for Semantic Segmentation
Deep Convolutional Nets
Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation.
CS 188: Artificial Intelligence Learning II: Linear Classification and Neural Networks Instructors: Stuart Russell and Pat Virtue University of California,
ImageNet Classification with Deep Convolutional Neural Networks Presenter: Weicong Chen.
Logan Lebanoff Mentor: Haroon Idrees
Convolutional Neural Network
Lecture 4a: Imagenet: Classification with Localization
Spatial Localization and Detection
Deep Residual Learning for Image Recognition
Introduction to Convolutional Neural Networks
Lecture 3b: CNN: Advanced Layers
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.
Convolutional Neural Networks
Cancer Metastases Classification in Histological Whole Slide Images
Deep Learning and Its Application to Signal and Image Processing and Analysis Class III - Fall 2016 Tammy Riklin Raviv, Electrical and Computer Engineering.
Recent developments in object detection
The Relationship between Deep Learning and Brain Function
Object Detection based on Segment Masks
Data Mining, Neural Network and Genetic Programming
Computer Science and Engineering, Seoul National University
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
CS 2770: Computer Vision Neural Networks
CSCI 5922 Neural Networks and Deep Learning: Convolutional Nets For Image And Speech Processing Mike Mozer Department of Computer Science and Institute.
Lecture 24: Convolutional neural networks
Regularizing Face Verification Nets To Discrete-Valued Pain Regression
Inception and Residual Architecture in Deep Convolutional Networks
Hierarchical Deep Convolutional Neural Network
ECE 6504 Deep Learning for Perception
Training Techniques for Deep Neural Networks
Efficient Deep Model for Monocular Road Segmentation
Deep Residual Learning for Image Recognition
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Project Implementation for ITCS4122
ECE 599/692 – Deep Learning Lecture 6 – CNN: The Variants
Deep Learning Convoluted Neural Networks Part 2 11/13/
Layer-wise Performance Bottleneck Analysis of Deep Neural Networks
Bird-species Recognition Using Convolutional Neural Network
Computer Vision James Hays
Introduction to Neural Networks
Image Classification.
Counting in Dense Crowds using Deep Learning
CS 4501: Introduction to Computer Vision Training Neural Networks II
ECE 599/692 – Deep Learning Lecture 4 – CNN: Practical Issues
Very Deep Convolutional Networks for Large-Scale Image Recognition
Smart Robots, Drones, IoT
A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE
Deep Learning for Non-Linear Control
Visualizing CNNs and Deeper Deep Architectures
Papers 15/08.
Object Tracking: Comparison of
Logistic Regression & Transfer Learning
Deep Learning Some slides are from Prof. Andrew Ng of Stanford.
The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3) Fait un tour historique du domaine: quels articles/travaux ont été marquants.
ImageNet Classification with Deep Convolutional Neural Networks
Heterogeneous convolutional neural networks for visual recognition
CSCI 5922 Neural Networks and Deep Learning: Convolutional Nets For Image And Speech Processing Mike Mozer Department of Computer Science and Institute.
Course Recap and What’s Next?
Natalie Lang Tomer Malach
CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Week 3 Volodymyr Bobyr.
Principles of Back-Propagation
ICLR, 2019 Jiahe Li
Presentation transcript:

The Problem: Classification Classify an image into 1000 possible classes: e.g. Abyssinian cat, Bulldog, French Terrier, Cormorant, Chickadee, red fox, banjo, barbell, hourglass, knot, maze, viaduct, etc. cat, tabby cat (0.71) Egyptian cat (0.22) red fox (0.11) …..

The Data: ILSVRC Imagenet Large Scale Visual Recognition Challenge (ILSVRC): Annual Competition 1000 Categories ~1000 training images per Category ~1 million images in total for training ~50k images for validation Only images released for the test set but no annotations, evaluation is performed centrally by the organizers (max 2 per week)

The Evaluation Metric: Top K-error Top-1 error: 1.0 Top-1 accuracy: 0.0 Top-2 error: 1.0 Top-2 accuracy: 0.0 True label: Abyssinian cat Top-3 error: 1.0 Top-3 accuracy: 0.0 Top-4 error: 0.0 Top-4 accuracy: 1.0 Top-5 error: 0.0 Top-5 accuracy: 1.0 cat, tabby cat (0.61) Egyptian cat (0.22) red fox (0.11) Abyssinian cat (0.10) French terrier (0.03) …..

Top-5 error on this competition (2012)

Alexnet https://www.saagie.com/fr/blog/object-detection-part1

What is happening? https://www.saagie.com/fr/blog/object-detection-part1

SIFT + FV + SVM (or softmax) Feature extraction (SIFT) Feature encoding (Fisher vectors) Classification (SVM or softmax) Convolutional Network (includes both feature extraction and classifier) Deep Learning

The Model: Convolutional Network

Alexnet in Keras

Preprocessing and Data Augmentation

Preprocessing and Data Augmentation 256 256

Preprocessing and Data Augmentation 224x224

Preprocessing and Data Augmentation 224x224

True label: Abyssinian cat

Other Important Aspects Using ReLUs instead of Sigmoid or Tanh Momentum + Weight Decay Dropout (Randomly sets Unit outputs to zero during training) GPU Computation!

Other Important Aspects Using ReLUs instead of Sigmoid or Tanh Momentum + Weight Decay Dropout (Randomly sets Unit outputs to zero during training) GPU Computation!

VGG Network Top-5: Keras: https://gist.github.com/baraldilorenzo/07d7802847aaad0a35d3 Simonyan and Zisserman, 2014.

GoogLeNet Keras: https://gist.github.com/joelouismarino/a2ede9ab3928f999575423b9887abd14 Szegedy et al. 2014

BatchNormalization Layer

ResNet Sorry, does not fit in slide. http://felixlaumon.github.io/assets/kaggle-right-whale/resnet.png Keras: https://github.com/raghakot/keras-resnet/blob/master/resnet.py

Slide by Mohammad Rastegari