Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media

Slides:



Advertisements
Similar presentations
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
Advertisements

Web Mining.
Exploring Traversal Strategy for Web Forum Crawling Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai, Lei Zhang and Wei-Ying Ma Chinese Academy of Sciences.
Incremental Clustering for Trajectories
Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia This work was done when Vincent.
Location-Based Social Networks Yu Zheng and Xing Xie Microsoft Research Asia Chapter 8 and 9 of the book Computing with Spatial Trajectories.
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:
A Unified Framework for Context Assisted Face Clustering
Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.
A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy Date : 2014/04/15 Source : KDD’13 Authors : Chi Wang, Marina Danilevsky, Nihit.
WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.
Dong Liu Xian-Sheng Hua Linjun Yang Meng Weng Hong-Jian Zhang.
Clustering by Passing Messages Between Data Points Brendan J. Frey and Delbert Dueck Science, 2007.
Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations Lu-An Tang, Yu Zheng, Xing Xie, Jing Yuan, Xiao Yu, Jiawei Han University of.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Recsplorer: Recommendation Algorithms Based on Precedence Mining Aditya Parameswaran Stanford University (Joint work with G. Koutrika, B. Bercovitz & H.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Kyle Heath, Natasha Gelfand, Maks Ovsjanikov, Mridul Aanjaneya, Leo Guibas Image Webs Computing and Exploiting Connectivity in Image Collections.
Vincent W. Zheng †, Bin Cao †, Yu Zheng ‡, Xing Xie ‡, Qiang Yang † † Hong Kong University of Science and Technology ‡ Microsoft Research Asia This work.
Mining Interesting Locations and Travel Sequences From GPS Trajectories Yu Zheng and Xing Xie Microsoft Research Asia March 16, 2009.
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
Search Result Diversification by M. Drosou and E. Pitoura Presenter: Bilge Koroglu June 14, 2011.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Friends and Locations Recommendation with the use of LBSN
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Recommendation system MOPSI project KAROL WAGA
Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Speaker: Ruirui Li 1 The University of Hong Kong.
Tag Ranking Present by Jie Xiao Dept. of Computer Science Univ. of Texas at San Antonio.
Presented by, Lokesh Chikkakempanna Authoritative Sources in a Hyperlinked environment.
Intent Subtopic Mining for Web Search Diversification Aymeric Damien, Min Zhang, Yiqun Liu, Shaoping Ma State Key Laboratory of Intelligent Technology.
Pattern-Growth Methods for Sequential Pattern Mining Iris Zhang
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Shape-based Similarity Query for Trajectory of Mobile Object NTT Communication Science Laboratories, NTT Corporation, JAPAN. Yutaka Yanagisawa Jun-ichi.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mining Trajectory Profiles for Discovering User Communities Speaker : Chih-Wen Chang National Chiao Tung University, Taiwan Chih-Chieh Hung,
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth Jiawei Han, Jian Pei, Helen Pinto, Behzad Mortazavi-Asl, Qiming Chen,
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Location-based Social Networks 6/11/20161 CENG 770.
Big data Analytics for Tourism Destination management
A Collaborative Quality Ranking Framework for Cloud Components
Mining User Similarity from Semantic Trajectories
Presented by: Mi Tian, Deepan Sanghavi, Dhaval Dholakia
Exploring Social Tagging Graph for Web Object Classification
Personalized Social Image Recommendation
Summary Presented by : Aishwarya Deep Shukla
Fast Preprocessing for Robust Face Sketch Synthesis
Frontiers of Computer Science, DOI: /s y
Jiawei Han Department of Computer Science
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Jiawei Han Department of Computer Science
Mashup Service Recommendation based on User Interest and Service Network Buqing Cao ICWS2013, IJWSR.
“Clustering by Passing Messages Between Data Points”
MAPO: Mining and Recommending API Usage Patterns
Mole: Motion Leaks through Smartwatch Sensors
Presentation transcript:

Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media Zhijun Yin1, Liangliang Cao1, Jiawei Han1, Jiebo Luo2, Thomas Huang1 University of Illinois at Urbana-Champaign1 Kodak Research Laboratories2

Outline Motivation Problem Formulation Framework Evaluation Conclusion

Motivation Social media websites such as Flickr, Facebook host overwhelming amounts of photos. In such a media sharing community, images are contributed, tagged, and commented by users all over the world. Extra information can be incorporated within social media, such as geographical information captured by GPS devices.

Motivation (Cont.)

Motivation (Cont.)

Ranking Diversification Motivation (Cont.) Explore the common wisdom in photo sharing community Discover trajectory patterns interesting to two kinds of users Some users are interested in the most important trajectory patterns. Some users are interested in exploring a new place in diverse way. Ranking Diversification

Problem Formulation Given a collection of geo-tagged photos along with users, locations and timestamps, how to rank the mined trajectory patterns with diversification into consideration.

Framework (1) Extracting trajectory patterns from the geo-tagged photo collection. (2) Ranking the trajectory patterns by estimating their importance according to user, location and trajectory relations. (3) Diversifying the ranking result to identify the representative trajectory patterns from all the candidates.

Trajectory Pattern Mining Since the GPS coordinates of photos are at a very fine granularity, we need to detect locations before extracting trajectory patterns. With the detected locations, we can generate the trajectories for each user according to his visiting order of locations during the same day. Mine frequent trajectory patterns using sequential pattern mining algorithm.

Location Detection Mean-shift algorithm (27974 photos in London)

Location Detection (Cont.) Top locations in London and their descriptions. The number in the parentheses is the number of users visiting the place.

Sequential Pattern Mining PrefixSpan[1] Example (minimum support = 2) We can get 3 frequent sequential patterns: londoneye -> bigben londoneye -> bigben -> trafalgarsquare londoneye -> tatemodern

Sequential Pattern Mining (Cont.) Top frequent trajectories in London

Sequential Pattern Mining (Cont.) There are too many trajectory patterns and it is difficult for the users to browse all the candidates. Ranking by frequency? The top ten trajectory patterns ranked by frequency are of length 2 and not informative. Ranking

Trajectory Pattern Ranking Relationship among user, location and trajectory

Trajectory Pattern Ranking (Cont.) A trajectory pattern is important if many important users take it and it contains important locations. A user is important if the user takes photos at important locations and visits the important trajectory patterns. An location is important if it occurs in one or more important trajectory patterns and many important users take photos at the location.

Trajectory Pattern Ranking (Cont.) PT is the eigen vector for MTM for the largest eigen value, where M = MTU MULMLT . Algorithm 1 is a normalized power iteration method to detect the eigen vector of MTM for the largest eigen value if the intial PT is not orthogonal to it.

Trajectory Pattern Ranking Top ranked trajectory patterns in London.

Trajectory Pattern Ranking (Cont.) Top ranked locations in London with normalized PL scores and frequency.

Trajectory Pattern Diversification The result in top ranked trajectories illustrates the popular routes together with important sites such as londoneye, bigben, and tatemodern. However, it is highly biased in only a few regions. Trajectory 1 (londoneye -> bigben -> downingstreet -> horseguards -> trafalgarsquare) Trajectory 5 (westminster -> bigben -> downingstreet -> horseguards ->trafalgarsquare) Diversification

Trajectory Pattern Diversification (Cont.) Similar trajectory patterns need to be aggregated together. Good exemplars of trajectory patterns need to be selected. Those trajectories patterns ranked highly in our ranking algorithm should get higher priority to be exemplars.

Trajectory Pattern Diversification (Cont.) We define the similarity between two trajectories based on longest common subsequence (LCSS). The similarity measure LCSS(i, j) can be viewed as how well trajectory i represents trajectory j. Suppose trajectory i is represented by an exemplar trajectory r(i), we can see that trajectory i becomes an exemplar if r(i) = i. The optimal set of exemplars corresponds to the ones for which the sum of similarities of each point to its exemplar is maximized.

Trajectory Pattern Diversification (Cont.) There are several ways of searching for the optimal exemplars such as vertex substitution heuristic p-median search and affinity propagation. Frey and Dueck's affinity propagation[2]: it considers all data points as potential exemplars and iteratively exchanges messages between data points until it finds a good solution with a set of exemplars. To incorporate the information of ranking results, we can give higher ranked trajectories larger self-similarity scores in message passing.

Trajectory Pattern Diversification (Cont.) Exemplars examples:

Trajectory Pattern Diversification (Cont.) Trajectory pattern diversification results in London.

Trajectory Pattern Diversification (Cont.)

Evaluation Data Sets We crawled images with GPS records using Flickr API (http://www.flickr.com/services/api/)

Evaluation (Cont.) Compared methods FreqRank: Rank trajectory patterns by sequential pattern frequency ClassicRank: The method used in [3] to mine classic travel sequences. The classical score of a sequence is the integration of the sum of hub scores of the users, the authority scores of the locations. TrajRank: Trajectory pattern ranking TrajDiv: Trajectory pattern diversification

Evaluation (Cont.) Measures NDCG (normalized discounted cumulative gain) highly interesting (2), interesting (1), not interesting (0) Location Coverage The number of covered locations in the top results. Trajectory Coverage The summation of the edit distance of each trajectory pattern in the dataset to the closest one in the top result. The score is normalized by the summation of the edit distance of each trajectory pattern to the closest one in the dataset.

Evaluation (Cont.) NDCG

Evaluation (Cont.) Location Coverage Trajectory Coverage

Evaluation (Cont.) London londoneye -> bigben -> downingstreet -> horseguards -> trafalgarsquare

Evaluation (Cont.) Paris eiffeltower -> louvremuseum -> notredamedeparis

Evaluation (Cont.) Location recommendation based on current trajectory in London

Conclusion We studied the problem of trajectory pattern ranking and diversification based on geo-tagged social media. We extracted trajectory patterns from geo-tagged photos using sequential pattern mining and proposed a ranking strategy that considers the relationships among user, location and trajectory. To diversify the ranking results, we used an exemplar-based algorithm to discover the representative trajectory patterns. We tested our methods on the photos of 12 different cities from Flickr and demonstrated their effectiveness.

Thanks!

Reference [1] J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M. Hsu. Mining sequential patterns by pattern-growth: The prexspan approach. IEEE Trans. Knowl. Data Eng., 16(11):1424-1440, 2004. [2] B. J. Frey and D. Dueck. Clustering by passing messages between data points. Science, 315:972-976, 2007. [3] Y. Zheng, L. Zhang, X. Xie, and W.-Y. Ma. Mining interesting locations and travel sequences from gps trajectories. In WWW, pages 791-800, 2009.