convolutional neural networkS

Slides:



Advertisements
Similar presentations
Convolutional Neural Networks
Advertisements

Advanced topics.
Presented by: Mingyuan Zhou Duke University, ECE September 18, 2009
Natural Language Processing Lab, Tsinghua University
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –(Finish) Backprop –Convolutional Neural Nets.
Convolutional Neural Network
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Xintao Wu University of Arkansas Introduction to Deep Learning 1.
DeepWalk: Online Learning of Social Representations
Survey on state-of-the-art approaches: Neural Network Trends in Speech Recognition Survey on state-of-the-art approaches: Neural Network Trends in Speech.
Yann LeCun Other Methods and Applications of Deep Learning Yann Le Cun The Courant Institute of Mathematical Sciences New York University
A Sentence Interaction Network for Modeling Dependence between Sentences Biao Liu, Minlie Huang Tsinghua University.
Sentiment analysis using deep learning methods
Demo.
Convolutional Neural Network
IEEE BIBM 2016 Xu Min, Wanwen Zeng, Ning Chen, Ting Chen*, Rui Jiang*
CS 388: Natural Language Processing: LSTM Recurrent Neural Networks
Deep Learning for Bacteria Event Identification
Deep Learning Amin Sobhani.
ECE 5424: Introduction to Machine Learning
Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.
Data Mining, Neural Network and Genetic Programming
ECE 5424: Introduction to Machine Learning
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Relation Extraction CSCI-GA.2591
Deep Learning for Natural Language Processing in R
Restricted Boltzmann Machines for Classification
References [1] - Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11): ,
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Combining CNN with RNN for scene labeling (segmentation)
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Intelligent Information System Lab
Deep learning and applications to Natural language processing
Lecture 5 Smaller Network: CNN
How it Works: Convolutional Neural Networks
CS6890 Deep Learning Weizhen Cai
Non-linear classifiers Neural networks
Introduction to Convolutional Neural Network (CNN/ConvNET)-insights from amateur George (Tian Zhou)
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
Deep Learning based Machine Translation
Computer Vision James Hays
Distributed Representation of Words, Sentences and Paragraphs
Convolutional Neural Networks for sentence classification
CNNs and compressive sensing Theoretical analysis
Introduction to Neural Networks
Towards Understanding the Invertibility of Convolutional Neural Networks Anna C. Gilbert1, Yi Zhang1, Kibok Lee1, Yuting Zhang1, Honglak Lee1,2 1University.
convolutional neural networkS
Convolutional Neural Networks
Deep learning Introduction Classes of Deep Learning Networks
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Convolutional neural networks Abin - Roozgard.
Machine Learning 101 Intro to AI, ML, Deep Learning
Seminar Topics and Projects
Creating Data Representations
Introduction to Natural Language Processing
SVM-based Deep Stacking Networks
Problems with CNNs and recent innovations 2/13/19
Presentation By: Eryk Helenowski PURE Mentor: Vincent Bindschaedler
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
CSC 578 Neural Networks and Deep Learning
Learn to Comment Mentor: Mahdi M. Kalayeh
Artificial Intelligence 2004 Speech & Natural Language Processing
The experiments based on Recurrent Neural Networks
Deep learning: Recurrent Neural Networks CV192
Peng Cui Tsinghua University
Bidirectional LSTM-CRF Models for Sequence Tagging
Presentation transcript:

convolutional neural networkS NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. Informationsverarbeitung II: Informationsextraktion mit Neuronalen Netzwerken Eduard Saller

Overview History CNN Topology Overview Example with step by step introduction of important terms CNNs for NLP Discussion

History Yann LeCun New York University Facebook Artificial Intelligence Research Yoshua Bengio Université de Montréal In 1995, Yann LeCun and Yoshua Bengio introduced the concept of convolutional neural networks.

CNN Topology http://www.nallatech.com/fpga-acceleration-convolutional-neural-networks/

Recap: Non-Linearity? A neural network is only non-linear if you squash the output signal from the nodes with a non-linear activation function. => arbitrary function approximator Interpreting the squashed output signal could very well be interpreted as the strength of this signal (biologically speaking)

Toy Example A two-dimensional array of pixels CNN X or O

Toy Example CNN X CNN O

Harder to solve problem CNN X translation scaling rotation weight CNN O

How to compare? ? =

Compare smaller areas = = =

Term: Filter

Term: Filter

Term: Convolution Used fitlers to create a filter layer based on the input - Yellow area: counts all ones that are part of ist diagonal - Green area: input features http://deeplearning.stanford.edu/wiki/index.php/Feature_extraction_using_convolution

Building the filter layer 1+1+1+1+1+1+1+1+1 9 =1

= = =

Recap: CNN Topology http://www.nallatech.com/fpga-acceleration-convolutional-neural-networks/

Term: Pooling Layers http://cs231n.github.io/convolutional-networks/#pool

Stacking Convolution Non-linear Pooling

Stacking Convolution Non-linear Pooling

Term: fully-connected layer takes an input volume and outputs an N dimensional vector

Toy example: fully-connected layer

fully-connected layer => Voting on an answer X O

Term: Strides http://cs231n.github.io/convolutional-networks/

Term: Strides without strides with strides

Term: Padding A Convolutional Neural Network for Modelling Sentences (2014)

Term: Padding

NLP: What area would be good fits classifications tasks, such as: Sentiment Analysis Spam Detection Topic Categorization

NLP: What area would be bad fits Convolutions and pooling operations lose information about the local order of words PoS Tagging Entity Extraction => sequence tagging is a harder to fit into a pure CNN architecture

NLP Source: Zhang, Y., & Wallace, B. (2015). A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification

References [1] Kim, Y. (2014). Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), 1746–1751. [2] Kalchbrenner, N., Grefenstette, E., & Blunsom, P. (2014). A Convolutional Neural Network for Modelling Sentences. Acl, 655–665. [3] Santos, C. N. dos, & Gatti, M. (2014). Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING-2014 (pp. 69–78). [4] Johnson, R., & Zhang, T. (2015). Effective Use of Word Order for Text Categorization with Convolutional Neural Networks. To Appear: NAACL-2015, (2011). [5] Johnson, R., & Zhang, T. (2015). Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding. [6] Wang, P., Xu, J., Xu, B., Liu, C., Zhang, H., Wang, F., & Hao, H. (2015). Semantic Clustering and Convolutional Neural Network for Short Text Categorization. Proceedings ACL 2015, 352–357. [7] Zhang, Y., & Wallace, B. (2015). A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification, [8] Nguyen, T. H., & Grishman, R. (2015). Relation Extraction: Perspective from Convolutional Neural Networks. Workshop on Vector Modeling for NLP, 39–48. [9] Sun, Y., Lin, L., Tang, D., Yang, N., Ji, Z., & Wang, X. (2015). Modeling Mention , Context and Entity with Neural Networks for Entity Disambiguation, (Ijcai), 1333–1339.

References [10] Zeng, D., Liu, K., Lai, S., Zhou, G., & Zhao, J. (2014). Relation Classification via Convolutional Deep Neural Network. Coling, (2011), 2335–2344.  [11] Gao, J., Pantel, P., Gamon, M., He, X., & Deng, L. (2014). Modeling Interestingness with Deep Neural Networks. [12] Shen, Y., He, X., Gao, J., Deng, L., & Mesnil, G. (2014). A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management – CIKM ’14, 101–110.  [13] Weston, J., & Adams, K. (2014). # T AG S PACE : Semantic Embeddings from Hashtags, 1822–1827. [14] Brandon Roher, Toy example [15] Santos, C., & Zadrozny, B. (2014). Learning Character-level Representations for Part-of-Speech Tagging. Proceedings of the 31st International Conference on Machine Learning, ICML-14(2011), 1818–1826.  [16] Zhang, X., Zhao, J., & LeCun, Y. (2015). Character-level Convolutional Networks for Text Classification, 1–9. [17] Zhang, X., & LeCun, Y. (2015). Text Understanding from Scratch. arXiv E-Prints, 3, 011102. [18] Kim, Y., Jernite, Y., Sontag, D., & Rush, A. M. (2015). Character-Aware Neural Language Models.