Finding Wormholes with Flickr Geotags Maarten Clements Marcel Reinders Arjen de Vries Pavel Serdyukov December 3 rd, 2009 GIS.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Introduction Distance-based Adaptable Similarity Search
Suleyman Cetintas 1, Monica Rogati 2, Luo Si 1, Yi Fang 1 Identifying Similar People in Professional Social Networks with Discriminative Probabilistic.
Aggregating local image descriptors into compact codes
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Empirical Evaluation of Dissimilarity Measures for Color and Texture
Exercising these ideas  You have a description of each item in a small collection. (30 web sites)  Assume we are looking for information about boxers,
Information Extraction from Multimedia Content on the Social Web Stefan Siersdorfer L3S Research Centre, Hannover, Germany.
Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Evaluating Search Engine
Flickr Tags Network Mustafa Kilavuz. Tags A tag is a keyword Search, spam detection, reputation systems, personal organization and metadata.
Object retrieval with large vocabularies and fast spatial matching
Efficient Processing of Top-k Spatial Keyword Queries João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg 1 SSTD 2011.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Generating Summaries and Visualization for Large Collections of Geo-referenced Photographs Alexander Jaffe*, Mor Naaman*, Tamir Tassa †, Marc Davis $ *Yahoo!
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Using Relevance Feedback in Multimedia Databases
ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.
A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)
Mobile Photos April 17, Auto Extraction of Flickr Tags Unstructured text labels Extract structured knowledge Place and event semantics Scale-structure.
Automatically obtain a description for a larger cluster of relevant documents Identify terms related to query terms  Synonyms, stemming variations, terms.
EVENT IDENTIFICATION IN SOCIAL MEDIA Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University.
Presented by: Michal Nir, Saar Gross Supervisors: Nadav Golbandi, Oren Somekh Computer Science Department Industrial Project (234313) Tuesday, January.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Philosophy of IR Evaluation Ellen Voorhees. NIST Evaluation: How well does system meet information need? System evaluation: how good are document rankings?
MASTER THESIS num. 802 ANALYSIS OF ALGORITHMS FOR DETERMINING TRUST AMONG FRIENDS ON SOCIAL NETWORKS Mirjam Šitum Ao. Univ. Prof. Dr. Dieter Merkl Univ.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Linking Wikipedia to the Web Antonio Flores Bernal Department of Computer Sciencies San Pablo Catholic University 2010.
Recommendation system MOPSI project KAROL WAGA
Query Routing in Peer-to-Peer Web Search Engine Speaker: Pavel Serdyukov Supervisors: Gerhard Weikum Christian Zimmer Matthias Bender International Max.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
INTELLIGENT ORACLE CEMNET, SCE, NTU Speaker: Zeng Zinan
April 14, 2003Hang Cui, Ji-Rong Wen and Tat- Seng Chua 1 Hierarchical Indexing and Flexible Element Retrieval for Structured Document Hang Cui School of.
AP Human Geography Central Place Theory.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Distributed Information Retrieval Server Ranking for Distributed Text Retrieval Systems on the Internet B. Yuwono and D. Lee Siemens TREC-4 Report: Further.
Flickr the framework of Flickr. Observe them  How many photos does each user offer?  How many tags does each photo have?  The tag hot-list  How many.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
1 A Compact Feature Representation and Image Indexing in Content- Based Image Retrieval A presentation by Gita Das PhD Candidate 29 Nov 2005 Supervisor:
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
1 Computational Vision CSCI 363, Fall 2012 Lecture 6 Edge Detection.
Personal Tag Semantic Relation Yi-Ching Huang 2008/02/27 Yi-Ching Huang 2008/02/27.
INAOE at GeoCLEF 2008: A Ranking Approach based on Sample Documents Esaú Villatoro-Tello Manuel Montes-y-Gómez Luis Villaseñor-Pineda Language Technologies.
Michael Bendersky, W. Bruce Croft Dept. of Computer Science Univ. of Massachusetts Amherst Amherst, MA SIGIR
Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department to Information and Computer Science –University of Trento.
Ken goldberg, gail de kosnik, kimiko ryokai (+ students) uc berkeley Opinion Space.
Flickr Tag Recommendation based on Collective Knowledge Hyunwoo Kim SNU IDB Lab. August 27, 2008 Borkur Sigurbjornsson, Roelof van Zwol Yahoo! Research.
Location-based Social Networks 6/11/20161 CENG 770.
Geotagging Social Media Content with a Refined Language Modelling Approach Georgios Kordopatis-Zilos, Symeon Papadopoulos, and Yiannis Kompatsiaris Centre.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Privacy Vulnerability of Published Anonymous Mobility Traces Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip (Purdue University) Nageswara S. V. Rao (Oak.
Big data Analytics for Tourism Destination management
Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.
6 ~ GIR.
Personalized Social Image Recommendation
Summary Presented by : Aishwarya Deep Shukla
Martin Rajman, Martin Vesely
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
AP Human Geography Central Place Theory.
by Khaled Nasr, Pooja Viswanathan, and Andreas Nieder
Ranking using Multiple Document Types in Desktop Search
A Neural Passage Model for Ad-hoc Document Retrieval
Presentation transcript:

Finding Wormholes with Flickr Geotags Maarten Clements Marcel Reinders Arjen de Vries Pavel Serdyukov December 3 rd, 2009 GIS

03/12/20092 Maarten Clements PhD: personalized retrieval in Social Media Faculty of EEMCS – ICT group. Supervisors º Marcel Reinders – Prof. Bioinformatics (and more) º Arjen de Vries – CWI, Prof. MM Dataspaces

03/12/20093 Maarten Clements Location prediction Predict relevant locations º Location  Location º User  Location Why? Flickr: MarsWFlickr: msokal 1 2 ?

03/12/20094 Maarten Clements Location prediction

03/12/20095 Maarten Clements Flickr Foto sharing website º Billions of photos º Active community: º Tags, Geotags, Favorites, Comments… M 91.4M Geotags in flickr

03/12/20096 Maarten Clements Flickr Using Flickr API to collect data: º Strategy to find people who geotag: First collected top cities in 'New York, NY, United States' 2. 'London, England, United Kingdom' 3. 'San Francisco, California, United States' 4. 'Paris, Ile-de-France, France' 5. … Lo Verdes, Canary Islands, Spain

03/12/20097 Maarten Clements Flickr Repeat: º Select a city based on full distribution º Get a photo at this location (geotagged) º Select the user who made the photo º Get all this users photos City

03/12/20098 Maarten Clements Flickr Users:36,264 Photos: 52,425,279 Geo Tags: 22,710,496

03/12/20099 Maarten Clements Flickr Tags Titles Time stamps Social network Descriptions Groups

03/12/ Maarten Clements Flickr

03/12/ Maarten Clements Wormholes Places that are similar but not necessarily spatially close. Use user travel patterns to detect these places Assumptions º Users have a certain travel preference º Users make photos at places they like

03/12/ Maarten Clements Wormholes Given a target location, find relevant users Weigh Euclidean distance with normal distribution

03/12/ Maarten Clements Wormholes Given a target location, find relevant users Weigh Euclidean distance with normal distribution Aggregate data over all users, using computed weights º 2000x4000 histogram, example 4x8: User 1:User 2:User 1+2:

03/12/ Maarten Clements Convolution: Wormholes Given a target location, find relevant users Weigh Euclidean distance with normal distribution Aggregate data over all users, using computed weights Compute convolution with Gaussian kernel Compute difference with expected geotag distribution

03/12/ Maarten Clements Wormholes Result

03/12/ Maarten Clements Wormholes Sigma determines how many users we call Relevant σ σ Many relevant usersFew relevant users

03/12/ Maarten Clements Evaluation Find ground truth data: Wikipedia, GeoNames

03/12/ Maarten Clements Evaluation Rank predicted peaks and compute precision Is there a mountain in a range of 3cells around the predicted peak?  Average Precision σ (km) So… Does it work?

03/12/ Maarten Clements Evaluation (manual)

03/12/ Maarten Clements Evaluation (manual) σ = 100km

03/12/ Maarten Clements Evaluation (manual) σ = 20m Target: Tour Eiffel

03/12/ Maarten Clements Evaluation (manual) σ = 20m Target: Tour Eiffel

03/12/ Maarten Clements Evaluation (manual) σ = 80m Target: Tour Eiffel

03/12/ Maarten Clements Evaluation (manual) σ = 80m Target: Tour Eiffel

03/12/ Maarten Clements Evaluation (manual) Target: Tour Eiffel σ = 300m

03/12/ Maarten Clements Evaluation (manual) Target: Tour Eiffel σ = 300m

03/12/ Maarten Clements Evaluation (manual) σ = 60m Target: Pere Lachaise

03/12/ Maarten Clements Evaluation (manual) σ = 60m Target: Pere Lachaise

03/12/ Maarten Clements What next? User  Location Query exists of multiple points (instead of 1) Get rid of grid based prediction º Compute kernel convolution peaks directly from continuous geotag data.

03/12/ Maarten Clements What next?

03/12/ Maarten Clements What next?

03/12/ Maarten Clements Conclusions We have proposed a new method to predict similar locations based on geotags. Scale parameter can be used to predict relevant locations at different scales. ECIR’10: Comparing different user aggregation methods

03/12/ Maarten Clements