Visual Question Generation

Slides:



Advertisements
Similar presentations
Deep Learning and Neural Nets Spring 2015
Advertisements

Deep Residual Learning for Image Recognition
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. SHOW.
Lecture 3a Analysis of training of NN
A Hierarchical Deep Temporal Model for Group Activity Recognition
A Sentence Interaction Network for Modeling Dependence between Sentences Biao Liu, Minlie Huang Tsinghua University.
What Convnets Make for Image Captioning?
CS 388: Natural Language Processing: LSTM Recurrent Neural Networks
CS 4501: Introduction to Computer Vision Computer Vision + Natural Language Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy / Justin Johnson.
Machine Learning for Big Data
Data Mining, Neural Network and Genetic Programming
The Problem: Classification
Show and Tell: A Neural Image Caption Generator (CVPR 2015)
Combining CNN with RNN for scene labeling (segmentation)
Ajita Rattani and Reza Derakhshani,
Hierarchical Deep Convolutional Neural Network
Intelligent Information System Lab
Synthesis of X-ray Projections via Deep Learning
mengye ren, ryan kiros, richard s. zemel
Machine Learning: The Connectionist
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
ECE 599/692 – Deep Learning Lecture 6 – CNN: The Variants
Layer-wise Performance Bottleneck Analysis of Deep Neural Networks
Attention-based Caption Description Mun Jonghwan.
Introduction to Neural Networks
Neural network systems
Deep Learning Tutorial
Home Automation Enhancement with EEG Based Headset (Orpheus)
Final Presentation: Neural Network Doc Summarization
SAS Deep Learning: From Toolkit to Fast Model Prototyping
Deep Visual-Semantic Alignments for Generating Image Descriptions
The Big Health Data–Intelligent Machine Paradox
Long Short Term Memory within Recurrent Neural Networks
Visualizing CNNs and Deeper Deep Architectures
Papers 15/08.
Machine Learning – Neural Networks David Fenyő
Recurrent Encoder-Decoder Networks for Time-Varying Dense Predictions
LECTURE 42: AUTOMATIC INTERPRETATION OF EEGS
LECTURE 41: AUTOMATIC INTERPRETATION OF EEGS
Logistic Regression & Transfer Learning
The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3) Fait un tour historique du domaine: quels articles/travaux ont été marquants.
Presentation By: Eryk Helenowski PURE Mentor: Vincent Bindschaedler
Heterogeneous convolutional neural networks for visual recognition
Course Recap and What’s Next?
Learn to Comment Mentor: Mahdi M. Kalayeh
ACID: Analyzing Census and Imaging Data
Lecture 21: Machine Learning Overview AP Computer Science Principles
Recurrent Neural Networks (RNNs)
Sequence to Sequence Video to Text
Automatic Handwriting Generation
Presented by: Anurag Paul
Natalie Lang Tomer Malach
CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Neural Machine Translation using CNN
Deep screen image crop and enhance
Deep screen image crop and enhance
Deep learning: Recurrent Neural Networks CV192
CRCV REU 2019 Kara Schatz.
Week 3 Volodymyr Bobyr.
End-to-End Speech-Driven Facial Animation with Temporal GANs
Real-time Object Recognition using deep learning-Raspberry Pi
Deep screen image crop and enhance
Prabhas Chongstitvatana Chulalongkorn University
First Name Last Name, First Name Last Name, First Name Last Name
CRCV REU 2019 Aaron Honculada.
Deep screen image crop and enhance
Lecture 9: Machine Learning Overview AP Computer Science Principles
ICLR, 2019 Jiahe Li
CVPR2019 Jiahe Li SiamRPN introduces the region proposal network after the Siamese network and performs joint classification and regression.
Presentation transcript:

Visual Question Generation Jhih-Ciang Wu Institution of Information Science, Academia Sinica jcwu@iis.sinica.edu.tw May. 8, 2018

Overview Backgrounds Baseline model References ILSVRC VGG RNN LSTM CNN+RNN References

ILSVRC ImageNet Large Scale Visual Recognition Challenge. In classfication task, we list winners over the years. AlexNet(2012) ZFNet(2013) VGGNet(2014 The second place) ResNet(2015) MaskRCNN(2017)

VGG VGG uses very small 3×3 filters in all convolutional layers.

VGG

RNN Recurrent Neural Network(RNN): allows it to exhibit dynamic temporal behavior.

LSTM Long Short-Term Memory(LSTM): a special kind of RNN, capable of learning long-term dependencies.

LSTM

LSTM

LSTM

LSTM

Baseline model

CNN+LSTM what color is the surfboard ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM is this a zebra ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM what color are the flowers ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM what is the green vegetable ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM how many people are in the picture ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

Modified MLP We use K-means method to separate training data into K clusters.

Reference Deep Visual-Semantic Alignments for Generating Image Descriptions Show and Tell: A Neural Image Caption Generator