Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method.

Slides:



Advertisements
Similar presentations
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Face Recognition: A Convolutional Neural Network Approach
Image Repairing: Robust Image Synthesis by Adaptive ND Tensor Voting IEEE Computer Society Conference on Computer Vision and Pattern Recognition Jiaya.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Machine learning continued Image source:
ACM Multimedia th Annual Conference, October , 2004
Very Low Resolution Face Recognition Problem
Support Vector Machines Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas Second Edition A Tutorial on Support Vector Machines for Pattern.
1 Visual Information Extraction in Content-based Image Retrieval System Presented by: Mian Huang Weichuan Dong Apr 29, 2004.
Presented by Zeehasham Rasheed
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.
R-CNN By Zhang Liliang.
K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Self-organizing GIS for solving problems of ecology and landscape studying Nikolay G. Markov, Alexandr A. Napryushkin Tomsk Polytechnical University, GIS.
Bridge Semantic Gap: A Large Scale Concept Ontology for Multimedia (LSCOM) Guo-Jun Qi Beckman Institute University of Illinois at Urbana-Champaign.
AUTOMATIC ANNOTATION OF GEO-INFORMATION IN PANORAMIC STREET VIEW BY IMAGE RETRIEVAL Ming Chen, Yueting Zhuang, Fei Wu College of Computer Science, Zhejiang.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Learning Decompositional Shape Models from Examples Alex Levinshtein Cristian Sminchisescu Sven Dickinson Sven Dickinson University of Toronto.
Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
 To Cover the basic theory and algorithms that are widely used in digital image processing.  To Expose students to current technologies and issues that.
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
Segmentation of 3D Tubular Structures Paul Hernandez-Herrera Computational Biomedicine Lab Advisor: Ioannis A. Kakadiaris and Manos Papadakis 1.
Categorization by Learning and Combing Object Parts B. Heisele, T. Serre, M. Pontil, T. Vetter, T. Poggio. Presented by Manish Jethwa.
Image Classification for Automatic Annotation
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Speaker Change Detection using Support Vector Machines V.Kartik, D.Srikrishna Satish and C.Chandra Sekhar Speech and Vision Laboratory Department of Computer.
Quiz Week 8 Topical. Topical Quiz (Section 2) What is the difference between Computer Vision and Computer Graphics What is the difference between Computer.
Machine Learning in CSC 196K
Musical Genre Categorization Using Support Vector Machines Shu Wang.
Regression analyses Model constructs per brand & modality Likeability, Affect, Brand knowledge, Brand attitude, Recognition 1.
Geographical Data Mining Thales Sehn Korting
BIS 219 Week 2 DQ 1 What is the relationship amongst data, information, and knowledge? What are the advantages of a knowledge management system? How may.
Inductive model evolved from data
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Given Slope & y-Intercept
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.
Geographical Data Mining
Semantic Video Classification
Color-Texture Analysis for Content-Based Image Retrieval
Self-Organizing Maps for Content-Based Image Database Retrieval
Using Transductive SVMs for Object Classification in Images
Speaker: Lingxi Xie Authors: Lingxi Xie, Qi Tian, Bo Zhang
Outline Announcement Texture modeling - continued Some remarks
Lecture 22 Clustering (3).
ECE 692 – Advanced Topics in Computer Vision
Types of operations The types of operations that can be applied to digital images to transform an input image a[m,n] into an output image b[m,n] (or another.
?. ? White Fuzzy Color Oblong Texture Shape Most problems in vision are underconstrained White Color Most problems in vision are underconstrained.
Aline Martin ECE738 Project – Spring 2005
CSE 635 Multimedia Information Retrieval
Papers 15/08.
Multimodal Caricatural Mirror
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Word2Vec.
Face Recognition: A Convolutional Neural Network Approach
Support vector machine-based text detection in digital video
Human-object interaction
Learning to Detect Human-Object Interactions with Knowledge
Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,
Presentation transcript:

Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method to project different modalities into a common knowledge vector space for a united knowledge representation. We construct a large-scale muti-modal relationship library

Motivation

Framework

Bi-enhanced cross-modal knowledge representation

Visual Relationship Recognition the input of this experiment is the image region containing visual relationship and the output is its relationship type extract all knowledge vectors from these relationship regions and use multi-class SVM to train the visual relationship recognition model

Zero-shot Multi-modal Retrieval