People-LDA using Face Recognition

Slides:

Advertisements

Similar presentations

Topic models Source: Topic models, David Blei, MLSS 09.

Advertisements

Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.

Simultaneous Image Classification and Annotation Chong Wang, David Blei, Li Fei-Fei Computer Science Department Princeton University Published in CVPR.

Title: The Author-Topic Model for Authors and Documents

Statistical Topic Modeling part 1

A Joint Model of Text and Aspect Ratings for Sentiment Summarization Ivan Titov (University of Illinois) Ryan McDonald (Google Inc.) ACL 2008.

Joint Sentiment/Topic Model for Sentiment Analysis Chenghua Lin & Yulan He CIKM09.

Caimei Lu et al. (KDD 2010) Presented by Anson Liang.

Optimizing Text Classification Mark Trenorden Supervisor: Geoff Webb.

Distributional Clustering of Words for Text Classification Authors: L.Douglas Baker Andrew Kachites McCallum Presenter: Yihong Ding.

Latent Dirichlet Allocation a generative model for text

LATENT DIRICHLET ALLOCATION. Outline Introduction Model Description Inference and Parameter Estimation Example Reference.

1. Social-Network Analysis Using Topic Models 2. Web Event Topic Analysis by Topic Feature Clustering and Extended LDA Model RMBI4310/COMP4332 Big Data.

Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.

Introduction to Machine Learning for Information Retrieval Xiaolong Wang.

Correlated Topic Models By Blei and Lafferty (NIPS 2005) Presented by Chunping Wang ECE, Duke University August 4 th, 2006.

Example 16,000 documents 100 topic Picked those with large p(w|z)

Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.

Building Face Dataset Shijin Kong. Building Face Dataset Ramanan et al, ICCV 2007, Leveraging Archival Video for Building Face DatasetsLeveraging Archival.

Online Learning for Latent Dirichlet Allocation

Crowdsourcing with Multi- Dimensional Trust Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department of Electrical.

1 Linmei HU 1, Juanzi LI 1, Zhihui LI 2, Chao SHAO 1, and Zhixing LI 1 1 Knowledge Engineering Group, Dept. of Computer Science and Technology, Tsinghua.

Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Mark M Hall Information School / Computer Science Sheffield University Sheffield, UK EuropeanaTech 2011, Vienna, 4 th - 6 th October 2011 Aggregating Cultural.

Finding the Hidden Scenes Behind Android Applications Joey Allen Mentor: Xiangyu Niu CURENT REU Program: Final Presentation 7/16/2014.

Integrating Topics and Syntax -Thomas L

The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.

Latent Dirichlet Allocation D. Blei, A. Ng, and M. Jordan. Journal of Machine Learning Research, 3: , January Jonathan Huang

Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.

 Goal recap  Implementation  Experimental Results  Conclusion  Questions & Answers.

Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.

Topic Modeling using Latent Dirichlet Allocation

Latent Dirichlet Allocation

Discovering Objects and their Location in Images Josef Sivic 1, Bryan C. Russell 2, Alexei A. Efros 3, Andrew Zisserman 1 and William T. Freeman 2 Goal:

Dynamic Multi-Faceted Topic Discovery in Twitter Date : 2013/11/27 Source : CIKM’13 Advisor : Dr.Jia-ling, Koh Speaker : Wei, Chang 1.

Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining Qiaozhu Mei and ChengXiang Zhai Department of Computer Science.

Text-classification using Latent Dirichlet Allocation - intro graphical model Lei Li

A PPLICATIONS OF TOPIC MODELS Daphna Weinshall B Slides credit: Joseph Sivic, Li Fei-Fei, Brian Russel and others.

Recent Paper of Md. Akmal Haidar Meeting before ICASSP 2013 報告者：郝柏翰 2013/05/23.

A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation Yee W. Teh, David Newman and Max Welling Published on NIPS 2006 Discussion.

Text Classification and Naïve Bayes Formalizing the Naïve Bayes Classifier.

Hierarchical Clustering & Topic Models

Topic Modeling for Short Texts with Auxiliary Word Embeddings

Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model Lingzi Hong Feb 10th.

Online Multiscale Dynamic Topic Models

The topic discovery models

An Additive Latent Feature Model

Document Classification Method with Small Training Data

Shuang-Hong Yang, Hongyuan Zha, Bao-Gang Hu NIPS2009

Measuring Sustainability Reporting using Web Scraping and Natural Language Processing Alessandra Sozzi

The topic discovery models

Trevor Savage, Bogdan Dit, Malcom Gethers and Denys Poshyvanyk

Latent Dirichlet Analysis

Multi-Dimensional Data Visualization

The topic discovery models

Topic Modeling Nick Jordan.

Stochastic Optimization Maximization for Latent Variable Models

Topic models for corpora and for graphs

Resource Recommendation for AAN

Michal Rosen-Zvi University of California, Irvine

Latent Dirichlet Allocation

LDA AND OTHER DIRECTED MODELS FOR MODELING TEXT

CS246: Latent Dirichlet Analysis

Junghoo “John” Cho UCLA

Topic models for corpora and for graphs

Topic Models in Text Processing

Unsupervised learning of visual sense models for Polysemous words

Jinwen Guo, Shengliang Xu, Shenghua Bao, and Yong Yu

Presentation transcript:

People-LDA using Face Recognition Lijie Heng 12/11/2007

Outline Background Latent Dirichlet allocation People-LDA Experiments Conclusion

Background Modeling text corpora –Latent Dirichlet allocation (LDA)‏ Newspaper articles (including captions + images)‏ captions->LDA->topic images->face recognition->people Could we built a joint model on both image and text information?

Latent Dirichlet allocation In the text corpora, assume a word <- vocabulary{1,2,…V} a documents <- N words a corpus <- M documents

Latent Dirichlet allocation-cont. To generate a document, we assume each document is generated from K topics and each topic is from N words from the vocabulary 1. Choose N ~ Poisson(x). 2. Choose θ ~ Dir(a). 3. For each of the N words w_n: (a) Choose a topic z_n ~ Multinomial(θ). (b) Choose a word w_n from p(w_n | z_n; β), a multinomial probability conditioned on the topic z_n.

Latent Dirichlet allocation-cont.

Latent Dirichlet allocation-cont.

People-LDA Take into account of image information in the documents Anchor each topic to a single person politics->George Bush, sports->Yao Ming

People-LDA cont. Assumptions 1. D documents in the corpus 2. K topics/people inside the corpus 3. Each document includes an image I and a caption W 4. Image I includes M faces, each faces contains H patches

People-LDA cont.

People-LDA cont.

People-LDA cont.

Experiments Experiments: 1. 10000 documents from “Face in the wild”; 2. randomly select 25 names from 1077 distinct names showing in 10000 documents; 3. Obtain 25 reference faces(one image per person) as Reference Image 4. do image clustering

Experiments Image alone: using face identifier to clustering each image into one of the reference images Text alone: first cluster the caption text using LDA. then for each caption, assign the face images to their most likely names under the multinomial distribution of topics People-LDA

Experiments-Clustering

Experiments-Clustering

Experiments- Classification Manually label the test images Compare the result image with the true label Report accuracy and perplexity(lower perplexity assigns higher the probability to correct images)‏

Experiments- Classification

Conclusion It’s a novel joint modeling of image and text. It has a better performance than other approaches. It can not associate names for people, whose reference images are not present.