convolutional neural networkS

Slides:



Advertisements
Similar presentations
Convolutional Neural Networks
Advertisements

Advanced topics.
Presented by: Mingyuan Zhou Duke University, ECE September 18, 2009
Deep Learning and Neural Nets Spring 2015
Natural Language Processing Lab, Tsinghua University
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –(Finish) Backprop –Convolutional Neural Nets.
Convolutional Neural Network
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Xintao Wu University of Arkansas Introduction to Deep Learning 1.
Yann LeCun Other Methods and Applications of Deep Learning Yann Le Cun The Courant Institute of Mathematical Sciences New York University
Sentiment analysis using deep learning methods
Demo.
Convolutional Neural Network
IEEE BIBM 2016 Xu Min, Wanwen Zeng, Ning Chen, Ting Chen*, Rui Jiang*
CS 388: Natural Language Processing: LSTM Recurrent Neural Networks
ECE 5424: Introduction to Machine Learning
Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.
Data Mining, Neural Network and Genetic Programming
ECE 5424: Introduction to Machine Learning
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Relation Extraction CSCI-GA.2591
Deep Learning for Natural Language Processing in R
Restricted Boltzmann Machines for Classification
References [1] - Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11): ,
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Combining CNN with RNN for scene labeling (segmentation)
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Intelligent Information System Lab
Lecture 5 Smaller Network: CNN
How it Works: Convolutional Neural Networks
CS6890 Deep Learning Weizhen Cai
Non-linear classifiers Neural networks
Introduction to Convolutional Neural Network (CNN/ConvNET)-insights from amateur George (Tian Zhou)
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
Deep Learning based Machine Translation
convolutional neural networkS
Computer Vision James Hays
Distributed Representation of Words, Sentences and Paragraphs
Convolutional Neural Networks for sentence classification
CNNs and compressive sensing Theoretical analysis
Introduction to Neural Networks
Towards Understanding the Invertibility of Convolutional Neural Networks Anna C. Gilbert1, Yi Zhang1, Kibok Lee1, Yuting Zhang1, Honglak Lee1,2 1University.
Convolutional Neural Networks
Deep learning Introduction Classes of Deep Learning Networks
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Convolutional neural networks Abin - Roozgard.
Machine Learning 101 Intro to AI, ML, Deep Learning
Seminar Topics and Projects
Creating Data Representations
On Convolutional Neural Network
SVM-based Deep Stacking Networks
Analysis of Trained CNN (Receptive Field & Weights of Network)
RCNN, Fast-RCNN, Faster-RCNN
How To Extend the Training Data
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
CSC 578 Neural Networks and Deep Learning
Learn to Comment Mentor: Mahdi M. Kalayeh
Artificial Intelligence 2004 Speech & Natural Language Processing
The experiments based on Recurrent Neural Networks
Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
Deep learning: Recurrent Neural Networks CV192
Peng Cui Tsinghua University
Bidirectional LSTM-CRF Models for Sequence Tagging
CSC 578 Neural Networks and Deep Learning
SDSEN: Self-Refining Deep Symmetry Enhanced Network
Presentation transcript:

convolutional neural networkS NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. Informationsverarbeitung II: Informationsextraktion mit Neuronalen Netzwerken Eduard Saller

Overview History CNN Topology Overview Example with step by step introduction of important terms CNNs for NLP Discussion

History Yann LeCun New York University Facebook Artificial Intelligence Research Yoshua Bengio Université de Montréal In 1995, Yann LeCun and Yoshua Bengio introduced the concept of convolutional neural networks.

CNN Topology http://www.nallatech.com/fpga-acceleration-convolutional-neural-networks/

Recap: Non-Linearity? A neural network is only non-linear if you squash the output signal from the nodes with a non-linear activation function. => arbitrary function approximator Interpreting the squashed output signal could very well be interpreted as the strength of this signal (biologically speaking)

Toy Example A two-dimensional array of pixels CNN X or O

Toy Example CNN X CNN O

Harder to solve problem CNN X translation scaling rotation weight CNN O

How to compare? ? =

Compare smaller areas = = =

Term: Filter

Term: Filter

Term: Convolution Used fitlers to create a filter layer based on the input - Yellow area: counts all ones that are part of ist diagonal - Green area: input features http://deeplearning.stanford.edu/wiki/index.php/Feature_extraction_using_convolution

Building the filter layer 1+1+1+1+1+1+1+1+1 9 =1

= = =

Recap: CNN Topology http://www.nallatech.com/fpga-acceleration-convolutional-neural-networks/

Term: Pooling Layers http://cs231n.github.io/convolutional-networks/#pool

Stacking Convolution Non-linear Pooling

Stacking Convolution Non-linear Pooling

Term: fully-connected layer takes an input volume and outputs an N dimensional vector

Toy example: fully-connected layer

fully-connected layer => Voting on an answer X O

Term: Strides http://cs231n.github.io/convolutional-networks/

Term: Strides without strides with strides

Term: Padding A Convolutional Neural Network for Modelling Sentences (2014)

Term: Padding

NLP: What area would be good fits classifications tasks, such as: Sentiment Analysis Spam Detection Topic Categorization

NLP Source: Zhang, Y., & Wallace, B. (2015). A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification

References [1] Kim, Y. (2014). Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), 1746–1751. [2] Kalchbrenner, N., Grefenstette, E., & Blunsom, P. (2014). A Convolutional Neural Network for Modelling Sentences. Acl, 655–665. [3] Santos, C. N. dos, & Gatti, M. (2014). Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING-2014 (pp. 69–78). [4] Johnson, R., & Zhang, T. (2015). Effective Use of Word Order for Text Categorization with Convolutional Neural Networks. To Appear: NAACL-2015, (2011). [5] Johnson, R., & Zhang, T. (2015). Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding. [6] Wang, P., Xu, J., Xu, B., Liu, C., Zhang, H., Wang, F., & Hao, H. (2015). Semantic Clustering and Convolutional Neural Network for Short Text Categorization. Proceedings ACL 2015, 352–357. [7] Zhang, Y., & Wallace, B. (2015). A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification, [8] Nguyen, T. H., & Grishman, R. (2015). Relation Extraction: Perspective from Convolutional Neural Networks. Workshop on Vector Modeling for NLP, 39–48. [9] Sun, Y., Lin, L., Tang, D., Yang, N., Ji, Z., & Wang, X. (2015). Modeling Mention , Context and Entity with Neural Networks for Entity Disambiguation, (Ijcai), 1333–1339.

References [10] Zeng, D., Liu, K., Lai, S., Zhou, G., & Zhao, J. (2014). Relation Classification via Convolutional Deep Neural Network. Coling, (2011), 2335–2344.  [11] Gao, J., Pantel, P., Gamon, M., He, X., & Deng, L. (2014). Modeling Interestingness with Deep Neural Networks. [12] Shen, Y., He, X., Gao, J., Deng, L., & Mesnil, G. (2014). A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management – CIKM ’14, 101–110.  [13] Weston, J., & Adams, K. (2014). # T AG S PACE : Semantic Embeddings from Hashtags, 1822–1827. [14] Brandon Roher, Toy example [15] Santos, C., & Zadrozny, B. (2014). Learning Character-level Representations for Part-of-Speech Tagging. Proceedings of the 31st International Conference on Machine Learning, ICML-14(2011), 1818–1826.  [16] Zhang, X., Zhao, J., & LeCun, Y. (2015). Character-level Convolutional Networks for Text Classification, 1–9. [17] Zhang, X., & LeCun, Y. (2015). Text Understanding from Scratch. arXiv E-Prints, 3, 011102. [18] Kim, Y., Jernite, Y., Sontag, D., & Rush, A. M. (2015). Character-Aware Neural Language Models.