Visual Question Generation

Slides:

Advertisements

Similar presentations

Deep Learning and Neural Nets Spring 2015

Advertisements

Deep Residual Learning for Image Recognition

NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. SHOW.

Lecture 3a Analysis of training of NN

A Hierarchical Deep Temporal Model for Group Activity Recognition

A Sentence Interaction Network for Modeling Dependence between Sentences Biao Liu, Minlie Huang Tsinghua University.

What Convnets Make for Image Captioning?

CS 388: Natural Language Processing: LSTM Recurrent Neural Networks

CS 4501: Introduction to Computer Vision Computer Vision + Natural Language Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy / Justin Johnson.

Machine Learning for Big Data

Data Mining, Neural Network and Genetic Programming

The Problem: Classification

Show and Tell: A Neural Image Caption Generator (CVPR 2015)

Combining CNN with RNN for scene labeling (segmentation)

Ajita Rattani and Reza Derakhshani,

Hierarchical Deep Convolutional Neural Network

Intelligent Information System Lab

Synthesis of X-ray Projections via Deep Learning

mengye ren, ryan kiros, richard s. zemel

Machine Learning: The Connectionist

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

ECE 599/692 – Deep Learning Lecture 6 – CNN: The Variants

Layer-wise Performance Bottleneck Analysis of Deep Neural Networks

Attention-based Caption Description Mun Jonghwan.

Introduction to Neural Networks

Neural network systems

Deep Learning Tutorial

Home Automation Enhancement with EEG Based Headset (Orpheus)

Final Presentation: Neural Network Doc Summarization

SAS Deep Learning: From Toolkit to Fast Model Prototyping

Deep Visual-Semantic Alignments for Generating Image Descriptions

The Big Health Data–Intelligent Machine Paradox

Long Short Term Memory within Recurrent Neural Networks

Visualizing CNNs and Deeper Deep Architectures

Machine Learning – Neural Networks David Fenyő

Recurrent Encoder-Decoder Networks for Time-Varying Dense Predictions

LECTURE 42: AUTOMATIC INTERPRETATION OF EEGS

LECTURE 41: AUTOMATIC INTERPRETATION OF EEGS

Logistic Regression & Transfer Learning

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3) Fait un tour historique du domaine: quels articles/travaux ont été marquants.

Presentation By: Eryk Helenowski PURE Mentor: Vincent Bindschaedler

Heterogeneous convolutional neural networks for visual recognition

Course Recap and What’s Next?

Learn to Comment Mentor: Mahdi M. Kalayeh

ACID: Analyzing Census and Imaging Data

Lecture 21: Machine Learning Overview AP Computer Science Principles

Recurrent Neural Networks (RNNs)

Sequence to Sequence Video to Text

Automatic Handwriting Generation

Presented by: Anurag Paul

Natalie Lang Tomer Malach

CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Neural Machine Translation using CNN

Deep screen image crop and enhance

Deep screen image crop and enhance

Deep learning: Recurrent Neural Networks CV192

CRCV REU 2019 Kara Schatz.

Week 3 Volodymyr Bobyr.

End-to-End Speech-Driven Facial Animation with Temporal GANs

Real-time Object Recognition using deep learning-Raspberry Pi

Deep screen image crop and enhance

Prabhas Chongstitvatana Chulalongkorn University

First Name Last Name, First Name Last Name, First Name Last Name

CRCV REU 2019 Aaron Honculada.

Deep screen image crop and enhance

Lecture 9: Machine Learning Overview AP Computer Science Principles

ICLR, 2019 Jiahe Li

CVPR2019 Jiahe Li SiamRPN introduces the region proposal network after the Siamese network and performs joint classification and regression.

Presentation transcript:

Visual Question Generation Jhih-Ciang Wu Institution of Information Science, Academia Sinica jcwu@iis.sinica.edu.tw May. 8, 2018

Overview Backgrounds Baseline model References ILSVRC VGG RNN LSTM CNN+RNN References

ILSVRC ImageNet Large Scale Visual Recognition Challenge. In classfication task, we list winners over the years. AlexNet(2012) ZFNet(2013) VGGNet(2014 The second place) ResNet(2015) MaskRCNN(2017)

VGG VGG uses very small 3×3 filters in all convolutional layers.

VGG

RNN Recurrent Neural Network(RNN): allows it to exhibit dynamic temporal behavior.

LSTM Long Short-Term Memory(LSTM): a special kind of RNN, capable of learning long-term dependencies.

LSTM

LSTM

LSTM

LSTM

Baseline model

CNN+LSTM what color is the surfboard ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM is this a zebra ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM what color are the flowers ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM what is the green vegetable ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

CNN+LSTM how many people are in the picture ? ∗learning rate = 0.00001, batch = 64, epochs = 100.

Modified MLP We use K-means method to separate training data into K clusters.

Reference Deep Visual-Semantic Alignments for Generating Image Descriptions Show and Tell: A Neural Image Caption Generator