TOPTRAC: Topical Trajectory Pattern Mining

Slides:

Advertisements

Similar presentations

Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.

Advertisements

Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.

Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.

Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :

Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.

Linking Named Entity in Tweets with Knowledge Base via User Interest Modeling Date : 2014/01/22 Author : Wei Shen, Jianyong Wang, Ping Luo, Min Wang Source.

CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )

Investigation of Web Query Refinement via Topic Analysis and Learning with Personalization Department of Systems Engineering & Engineering Management The.

CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.

Multiscale Topic Tomography Ramesh Nallapati, William Cohen, Susan Ditmore, John Lafferty & Kin Ung (Johnson and Johnson Group)

LATENT DIRICHLET ALLOCATION. Outline Introduction Model Description Inference and Parameter Estimation Example Reference.

Scalable Text Mining with Sparse Generative Models

CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.

1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.

Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Topic Modelling: Beyond Bag of Words By Hanna M. Wallach ICML 2006 Presented by Eric Wang, April 25 th 2008.

Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]

Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.

Date : 2014/01/14 Author : Thanh-Son Nguyen, Hady W. Lauw, Panayiotis Tsaparas Source : CIKM’13 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng.

Integrating Topics and Syntax -Thomas L

Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.

Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval Ben Cartrette and Praveen Chandar Dept. of Computer and Information Science.

BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.

Latent Dirichlet Allocation D. Blei, A. Ng, and M. Jordan. Journal of Machine Learning Research, 3: , January Jonathan Huang

Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.

Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.

Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.

1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,

Intelligent Database Systems Lab Advisor ： Dr. Hsu Graduate ： Chien-Shing Chen Author ： Juan D.Velasquez Richard Weber Hiroshi Yasuda 國立雲林科技大學 National.

Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.

Latent Dirichlet Allocation

A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,

 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.

Dynamic Multi-Faceted Topic Discovery in Twitter Date : 2013/11/27 Source : CIKM’13 Advisor : Dr.Jia-ling, Koh Speaker : Wei, Chang 1.

1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:

Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining Qiaozhu Mei and ChengXiang Zhai Department of Computer Science.

PERSONALIZED DIVERSIFICATION OF SEARCH RESULTS Date: 2013/04/15 Author: David Vallet, Pablo Castells Source: SIGIR’12 Advisor: Dr.Jia-ling, Koh Speaker:

An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.

Discovering Evolutionary Theme Patterns from Text -An exploration of Temporal Text Mining KDD’05, August 21–24, 2005, Chicago, Illinois, USA. Qiaozhu Mei.

Text-classification using Latent Dirichlet Allocation - intro graphical model Lei Li

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.

Inferring User Interest Familiarity and Topic Similarity with Social Neighbors in Facebook INSTRUCTOR: DONGCHUL KIM ANUSHA BOOTHPUR

A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation Yee W. Teh, David Newman and Max Welling Published on NIPS 2006 Discussion.

Hidden Markov Models BMI/CS 576

Topic Modeling for Short Texts with Auxiliary Word Embeddings

Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model Lingzi Hong Feb 10th.

Customized of Social Media Contents using Focused Topic Hierarchy

Click Through Rate Prediction for Local Search Results

Where Did You Go: Personalized Annotation of Mobility Records

Online Multiscale Dynamic Topic Models

Open question answering over curated and extracted knowledge bases

On the Generative Discovery of Structured Medical Knowledge

J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009

Speaker: Jim-an tsai advisor: professor jia-lin koh

Topic Modeling Nick Jordan.

Stochastic Optimization Maximization for Latent Variable Models

Speaker: Jim-An Tsai Advisor: Professor Jia-ling Koh

Michal Rosen-Zvi University of California, Irvine

Sourse: Www 2017 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu

Latent Dirichlet Allocation

CS246: Latent Dirichlet Analysis

Topic Models in Text Processing

DBRef:Discussion on New Features

Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,

Wiki3C: Exploiting Wikipedia for Context-aware Concept Categorization

Discovering Important Nodes through Graph Entropy

Heterogeneous Graph Attention Network

Attention Is All You Need

Presentation transcript:

TOPTRAC: Topical Trajectory Pattern Mining Source: KDD 2015 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu Date: 2018/1/22

Outline Introduction Method Experient conclusion

Introduction

Introduction Goal Topical trajectory mining problem: Given a collection of geo-tagged message trajectories, it’s to find topical transition pattern and the top-k transition snippets which best represent each transition pattern

Introduction Transition pattern Transition snippet

Introduction Definition Trajectory(st) geo-tagged message (mt,i) Geo-tag Gt,i : 2-dim vector(Gt,i,x,Gt,i,y) Bag-of-word wt,i : N words{wt,i,1,…, wt,i,n}

Introduction Definition Latent semantic region: a geographical location where messages are posted with the same topic preference Topical transition pattern: a movement from one semantic region to another frequently

Outline Introduction Method Experience conclusion

Method Generative Model Assume there are M latent semantic regions K hidden topics in the collection of geo-tagged messages How to generate each sequence st = (mt,1, mt,2 , … , mt,n )

Method Generative process Ex:Θ1=(topic1,…topic k ) Ex:Φk=(word1,…,word v) 0.3 0.2 0.6 2 3

Method λt : Bernoulli distribution(0~1) St,i = {0,1}: mt,1 mt,2 λt : Bernoulli distribution(0~1) St,i = {0,1}: Whether mt,i is in the local context

Method Case1: St,i = 1 Case2: i =1and St,i = 1 or mt,1 mt,2 Case1: St,i = 1 Select Rt,i = Uniform(1/M) Generate Gt,i = Uniform(f0) Case2: i =1and St,i = 1 or i >= 2 and St,i = 1 and St,i-1 = 0 Select Rt,I = Categorical(δ0) Generate Gt,I = f(Gt,I)

Method Case3: else Select Rt,I = Categorical(δr(t,i-1),z(t,i-1)) mt,1 mt,3 Method Case3: else Select Rt,I = Categorical(δr(t,i-1),z(t,i-1)) Generate Gt,I = f(Gt,I)

Method Select Topic Select a message Zt,i = Categorical(θRt,i) wt,I = Multinomial(ΦZt,i)

Method Likelihood

Method Variational EM Algorithm Maximum likelihood estimation θR, Φk, λt St,i, Rt,i, Zt,I μr, Σr

Method Finding the Most Likely Sequence Notations: : maximum probability to generate the subsequence when St,i=0 : : maximum probability to generate the subsequence when St,i=1

Method Compute : Compute : case1: St,i-1 = 0 ; case2 : St,i-1 = 1

Method Finding Frequent Transition Patterns st’ = {(st,1, rt,1, zt,1),…,(st,n, rt,n, zt,n)} Transition Patterns = {( r1, z1)(r2, z2)} Start with (1, r1, z1) and ends with (1, r2, z2) τ : minimum support

Method Example Top-k transition snippets k largest probabilities of s1’={(0,1,1)(1,1,2)(1,2,1)}, s2’={(1,1,2)(0,2,1)(1,2,1)} with τ = 2 → {(1,2)(2,1)} is a transition pattern Top-k transition snippets k largest probabilities of

Outline Introduction Method Experience conclusion

Experience Data sets NYC SANF 9070 trajectories, 266808 geo-tagged messages M = 30, K = 30, τ = 100 SANF 809 trajectories,19664 geo-tagged messages M = 20, K = 20, τ = 10

Experience Baseline LGTA NAÏVE Run the inference algorithm and find frequent trajectory patterns similar in page15,16 NAÏVE First groups messages using EM clustering Cluster the messages in each group with LDA

Experience

Experience

Experience

Experience

Outline Introduction Method Experience conclusion

Conclusion Propose a trajectory pattern mining algorithm, called TOPTRAC, using probabilistic model to capture the spatial and topical patterns of users. Developed an efficient inference algorithm for our model and also devised algorithms to find frequent transition patterns as well as the best representative snippets of each pattern.