Dynamic Supervised Community-Topic Model

Slides:



Advertisements
Similar presentations
Sinead Williamson, Chong Wang, Katherine A. Heller, David M. Blei
Advertisements

Google News Personalization: Scalable Online Collaborative Filtering
A probabilistic model for retrospective news event detection
Topic models Source: Topic models, David Blei, MLSS 09.
An Adaptive Learning Method for Target Tracking across Multiple Cameras Kuan-Wen Chen, Chih-Chuan Lai, Yi-Ping Hung, Chu-Song Chen National Taiwan University.
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
Community Detection with Edge Content in Social Media Networks Paper presented by Konstantinos Giannakopoulos.
Fast Algorithms For Hierarchical Range Histogram Constructions
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
Simultaneous Image Classification and Annotation Chong Wang, David Blei, Li Fei-Fei Computer Science Department Princeton University Published in CVPR.
Title: The Author-Topic Model for Authors and Documents
Text-Based Measures of Document Diversity Date : 2014/02/12 Source : KDD’13 Authors : Kevin Bache, David Newman, and Padhraic Smyth Advisor : Dr. Jia-Ling,
Statistical Topic Modeling part 1
Generative Topic Models for Community Analysis
Caimei Lu et al. (KDD 2010) Presented by Anson Liang.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei Han 1 University of Illinois, IBM TJ Watson Debapriya Basu.
On Community Outliers and their Efficient Detection in Information Networks Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei Han 1.
Visual Recognition Tutorial
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Modeling Scientific Impact with Topical Influence Regression James Foulds Padhraic Smyth Department of Computer Science University of California, Irvine.
Semantic History Embedding in Online Generative Topic Models Pu Wang (presenter) Authors: Loulwah AlSumait Daniel Barbará
1 A Discriminative Approach to Topic- Based Citation Recommendation Jie Tang and Jing Zhang Presented by Pei Li Knowledge Engineering Group, Dept. of Computer.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
EM and expected complete log-likelihood Mixture of Experts
Segmental Hidden Markov Models with Random Effects for Waveform Modeling Author: Seyoung Kim & Padhraic Smyth Presentor: Lu Ren.
Annealing Paths for the Evaluation of Topic Models James Foulds Padhraic Smyth Department of Computer Science University of California, Irvine* *James.
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Presented by Chen Yi-Ting.
Topic Modelling: Beyond Bag of Words By Hanna M. Wallach ICML 2006 Presented by Eric Wang, April 25 th 2008.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Uncovering Overlap Community Structure in Complex Networks using Particle Competition Fabricio A. Liang
Multiple Location Profiling for Users and Relationships from Social Network and Content Rui Li, Shengjie Wang, Kevin Chen-Chuan Chang University of Illinois.
Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.
Introduction to LDA Jinyang Gao. Outline Bayesian Analysis Dirichlet Distribution Evolution of Topic Model Gibbs Sampling Intuition Analysis of Parameter.
Gaussian Processes Li An Li An
Lecture 6 Spring 2010 Dr. Jianjun Hu CSCE883 Machine Learning.
Generalized Model Selection For Unsupervised Learning in High Dimension Vaithyanathan and Dom IBM Almaden Research Center NIPS ’ 99.
Learning Sequence Motifs Using Expectation Maximization (EM) and Gibbs Sampling BMI/CS 776 Mark Craven
Recommend User to Group in Flickr Zhe Zhao
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
1 CS 430: Information Discovery Lecture 5 Ranking.
Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes ∗ Source: VLDB.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
A Nonparametric Method for Early Detection of Trending Topics Zhang Advisor: Prof. Aravind Srinivasan.
An unsupervised conditional random fields approach for clustering gene expression time series Chang-Tsun Li, Yinyin Yuan and Roland Wilson Bioinformatics,
Paper Presentation Social influence based clustering of heterogeneous information networks Qiwei Bao & Siqi Huang.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation Yee W. Teh, David Newman and Max Welling Published on NIPS 2006 Discussion.
Topic Modeling for Short Texts with Auxiliary Word Embeddings
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Online Multiscale Dynamic Topic Models
DM-Group Meeting Liangzhe Chen, Nov
Clustering Evaluation The EM Algorithm
J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009
An Efficient method to recommend research papers and highly influential authors. VIRAJITHA KARNATAPU.
Author: Kazunari Sugiyama, etc. (WWW2004)
Weakly Learning to Match Experts in Online Community
Bayesian Inference for Mixture Language Models
Stochastic Optimization Maximization for Latent Variable Models
RECOMMENDER SYSTEMS WITH SOCIAL REGULARIZATION
Topic models for corpora and for graphs
Learning Literature Search Models from Citation Behavior
Robust Full Bayesian Learning for Neural Networks
Michal Rosen-Zvi University of California, Irvine
Junghoo “John” Cho UCLA
GhostLink: Latent Network Inference for Influence-aware Recommendation
Presentation transcript:

Dynamic Supervised Community-Topic Model Daifeng Li 2011-5-10

Existing Study 1. H Zhang, B Qiu and etc (2007). LDA based Community Structure

Existing Study 2. Y Liu, A Niculescu-Mizil and etc(2009). Joint Model of Topic&Author Community (mainly used to make link prediction)

Existing Study 3. D Zhou & E Manavoglu(2006). Prob Model for discoverying e-communities. WWW2006

The Proposed Model Integrate Supervised mechanism into ACT model:

The Proposed Model The Purpose of New Model: 1. Detect Community from Topic-Level; 2. Make Dynamic analysis on Community and Topic evaluation; 3. Add Topic Label for citation relationship; 4. Improve the quality of detected Community(Topic similarity and conductance) 5. Rank the authors/users in a community in a more efficient way(not only judge by their activities)

The Proposed Model The Description of New Model: 1. Supervised Community-Topic Model (Gibbs EM) Sampling()//Sampling parts belong to E step {for each document d: for each author x: select word w, community cm, topic z, conference c according to prior parameters (multinomial distribution); end select supervised author y (whose paper the author x cite as reference) for each assignment according to prior parameters(Gaussian Distribution with mean value as and variance as ); }

The Proposed Model The Description of New Model: 1. Supervised Community-Topic Model (Gibbs EM) How to sample supervised author: a. Calculate Scores for each supervised author: Assume the author has written m papers, each paper has been cited for n times, the author’s position in each paper in r, then the score can be computed as: Score(author x)= b. Sampling: For each supervised author, it should be Score*P; where P means the Prob that author appears in current Community and Topic.

The Proposed Model The Description of New Model: 1. Supervised Community-Topic Model (Gibbs EM) Updating_Parameters() //Optimal Parameters and by M steps { Update } Where X is a D*CM matrix, each element is:

The Proposed Model The Description of New Model: 1. Supervised Community-Topic Model (Gibbs EM)

The Proposed Model The Description of New Model: for each iterations (1,000): //E Steps: for each document d: for each author x: Sample comm, topic, word, conference; end Sample supervised author for current author; End //M Steps: Update Parameters and

The proposed Model The Description of New Model:

Evaluation 1. Average H-Index and Average activities: For each Community(Compared the paper in WWW 2006); . 2. Compare Conductance between proposed algorithm and paper in WWW 2006; 3. Make prediction for computer committee; 4. Granger/ARMA analysis for Dynamic Community, Topic evolution.

Thanks!