Toward improved document classification and retrieval

Slides:



Advertisements
Similar presentations
Classification spotlights
Advertisements

ImageNet Classification with Deep Convolutional Neural Networks
Deep Learning and Neural Nets Spring 2015
Spatial Pyramid Pooling in Deep Convolutional
Deep Learning for Big Data P. Baldi University of California, Irvine Department of Computer Science Institute for Genomics and Bioinformatics Center for.
Kuan-Chuan Peng Tsuhan Chen
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Detection, Segmentation and Fine-grained Localization
Unsupervised Visual Representation Learning by Context Prediction
Philipp Gysel ECE Department University of California, Davis
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
NTNU Speech and Machine Intelligence Laboratory 1 Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models 2016/05/31.
Deep Learning for Big Data
Recent developments in object detection
Faster R-CNN – Concepts
Inter-experimental LHC Machine Learning Working Group Activities
Machine Learning for Big Data
Object Detection based on Segment Masks
ISBI Camelyon16 Challenge Prague, April 13, 2016
New Machine Learning in Medical Imaging Journal Club
Prepared by: Mahmoud Rafeek Al-Farra
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Saliency-guided Video Classification via Adaptively weighted learning
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Combining CNN with RNN for scene labeling (segmentation)
Part-Based Room Categorization for Household Service Robots
Ajita Rattani and Reza Derakhshani,
Hierarchical Deep Convolutional Neural Network
Natural Language Processing of Knee MRI Reports
Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan.
Summary Presentation.
ECE 6504 Deep Learning for Perception
CS6890 Deep Learning Weizhen Cai
Machine Learning: The Connectionist
Adri`a Recasens, Aditya Khosla, Carl Vondrick, Antonio Torralba
Bird-species Recognition Using Convolutional Neural Network
Quanzeng You, Jiebo Luo, Hailin Jin and Jianchao Yang
Introduction to Neural Networks
Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.
Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision
Alain Goossens & Jean-Pierre Van Loo Data scientists – SII Belgium
Object Detection + Deep Learning
Very Deep Convolutional Networks for Large-Scale Image Recognition
8-3 RRAM Based Convolutional Neural Networks for High Accuracy Pattern Recognition and Online Learning Tasks Z. Dong, Z. Zhou, Z.F. Li, C. Liu, Y.N. Jiang,
Machine Learning 101 Intro to AI, ML, Deep Learning
Creating Data Representations
Lecture: Deep Convolutional Neural Networks
Outline Background Motivation Proposed Model Experimental Results
Recurrent Encoder-Decoder Networks for Time-Varying Dense Predictions
John H.L. Hansen & Taufiq Al Babba Hasan
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
Ladislav Rampasek, Anna Goldenberg  Cell 
Heterogeneous convolutional neural networks for visual recognition
Abnormally Detection
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Reuben Feinman Research advised by Brenden Lake
Department of Computer Science Ben-Gurion University of the Negev
Automating stroke lesion segmentation in brain images using a multi-model multi-path convolutional neural network Yunzhe.
Natalie Lang Tomer Malach
CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Semantic Segmentation
Object Detection Implementations
Trusting Machine Learning Algorithms for Safeguards Applications
End-to-End Facial Alignment and Recognition
Report 2 Brandon Silva.
Prabhas Chongstitvatana Chulalongkorn University
Tschandl P1,2, Argenziano G3, Razmara M4, Yap J4
Presentation transcript:

Toward improved document classification and retrieval Richard Muñoz EECS 6898 May 5, 2016

Document Classification/Retrieval Images from LexisNexis and kCura

CNNs to the Rescue? Images from: Krizhevsky et al. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. Harley et al. (2015, August). Evaluation of deep convolutional nets for document image classification and retrieval. In ICDAR.

Can we do better? Build Decision Trees that consider document structure Crop out region and send to CNN denoted by leaf Predict classification or relevance score Inspired by context-dependent selection of GMMs (and later NNs) in speech recognition Difficult layout segmentation or unknown layouts: Back off to single CNN model

Data Sources Tobacco litigation files Medical journal articles NIST tax forms Patent figures Potentially: FOIA requests Collaborations Images from: - Csurka et al. (2016). What is the right way to represent document images?. arXiv preprint arXiv:1603.01076.

Evaluation Classification Accuracy Mean average precision Performance on poorly OCR’d documents given higher weight Scalability with respect to Labeling datasets Computational time for CNNs

Thank you