#VisualHashtags Visual Summarization of Social Media Events using Mid-Level Visual Elements Sonal Goel (IIIT-Delhi), Sarthak Ahuja (IBM Research, India),

Slides:



Advertisements
Similar presentations
Presentation at Society of The Query conference, Amsterdam November 13-14, 2009 (original title: Learning from Google: software design as a methodology.
Advertisements

Less is More Probabilistic Model for Retrieving Fewer Relevant Docuemtns Harr Chen and David R. Karger MIT CSAIL SIGIR2006 4/30/2007.
Query Chain Focused Summarization Tal Baumel, Rafi Cohen, Michael Elhadad Jan 2014.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.
Patch to the Future: Unsupervised Visual Prediction
Active Learning and Collaborative Filtering
Evaluating Search Engine
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Large-Scale Deduplication with Constraints using Dedupalog Arvind Arasu et al.
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
Supporting the Automatic Construction of Entity Aware Search Engines Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti Dipartimento di Informatica.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Bug Localization with Machine Learning Techniques Wujie Zheng
A Machine Learning Approach to Sentence Ordering for Multidocument Summarization and Its Evaluation D. Bollegala, N. Okazaki and M. Ishizuka The University.
A Graph-based Friend Recommendation System Using Genetic Algorithm
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Alvin CHAN Kay CHEUNG Alex YING Relationship between Twitter Events and Real-life.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Visualization in Process Mining
Generating data with enacted methods
Cohesive Subgraph Computation over Large Graphs
Queensland University of Technology
AP CSP: Cleaning Data & Creating Summary Tables
Topical Authority Detection and Sentiment Analysis on Top Influencers
A Visualization Tool for fMRI Data Mining
Evaluation of IR Systems
An Empirical Study of Learning to Rank for Entity Search
On Assigning Implicit Reputation Scores in an Online Labor Marketplace
Patterns extraction from process executions
Summary Presented by : Aishwarya Deep Shukla
Video Google: Text Retrieval Approach to Object Matching in Videos
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
On the Scale and Performance of Cooperative Web Proxy Caching
Summarizing Entities: A Survey Report
Dieudo Mulamba November 2017
Data Mining Cluster Analysis: Advanced Concepts and Algorithms
Zan Gao, Deyu Wang, Xiangnan He, Hua Zhang
Applying Key Phrase Extraction to aid Invalidity Search
Discovering Functional Communities in Social Media
Vessel Extraction in X-Ray Angiograms Using Deep Learning
A Level Art & Design – Summer Project
CS 594: Empirical Methods in HCC Social Network Analysis in HCI
MIS2502: Data Analytics Clustering and Segmentation
Pooria Taghizadeh : Dr. Hadi Tabatabaee : Dr. Mona Ghassemian :
MIS2502: Data Analytics Clustering and Segmentation
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
Housam Babiker, Randy Goebel and Irene Cheng
Video Google: Text Retrieval Approach to Object Matching in Videos
Presented By: Grant Glass
Introduction Dataset search
Connecting the Dots Between News Article
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Social Media Interview
Presentation transcript:

#VisualHashtags Visual Summarization of Social Media Events using Mid-Level Visual Elements Sonal Goel (IIIT-Delhi), Sarthak Ahuja (IBM Research, India), A.V. Subramanyam (IIIT-Delhi), Ponnurangam Kumaraguru (IIIT-Delhi) ACM Multimedia 2017

Introduction The data generated on social media sites grows at an increasing rate with more than 36% of tweets containing images We aim to discover #VisualHashtags, i.e., meaningful patches that can become the visual analog of a regular text hashtag that Twitter generates These the entities which are both representative and discriminative to the target dataset besides being human-readable and hence more informative Our core novelty – a novel pipeline to summarize images from social media events instead of the conventional method of only identifying key-images to represent the event. Our approach includes a multi-stage filtering process which when coupled with the basic methodology to discover mid-level visual elements leads to an improvement in coverage over existing methods.

Prior Art Work on Social Media Image Dataset Summarization Schinas et al. use both tweets and images to summarize an event. My reveal topics from a set of tweets as highly connected messages in a graph, whose nodes encode messages and whose edges encode their similarities. Finally, the images that best represent the topic are selected based on their relevance and diversity. Cavalin et al. propose a social media imagery analytics system that processes and organize the images in more manageable way by removing duplicate, near-duplicate images and clustering images having similar content Work on Image Dataset Summarization at Patch Level Doersch et al. collect data from Google Street View of different cities, and aim to automatically and the visual patches like windows, balconies, and street signs, that are most distinctive for a certain geo-spatial area. Rematas et al. propose data-mining approach for exploring image collections by interesting patterns that use discriminative patches and further show the results on Pascal VOC and Microsoft COCO datasets.

Methodology 1 2

Algorithm scorei is the score of the detector n is the number of nearest neighbors, c is the class of the nearest neighbor (1 for negative, 0 for positive), Sj is the frequency (number of duplicates) of the image to which the patch belongs.

Dataset The data generated on social media sites grows at an increasing rate with more than 36% of tweets containing images We aim to discover #VisualHashtags, i.e., meaningful patches that can become the visual analog of a regular text hashtag that Twitter generates These the entities which are both representative and discriminative to the target dataset besides being human-readable and hence more informative

Results – 1 (Sports)

Results – 2 (Politics)

Results – 3 (Temporal Analysis)

Results – 3 (Pattern Mining)

Evaluation - Quantitative

Evaluation - Qualitative User Study – 21 People Given the set of patches below, choose the most appropriate event which it summarizes. Select all the patches that can be distinctly linked with the event chosen above. Select all the patches that are Meaningful ,i.e. covering a meaningful part of an image How many of the below rows demonstrate strong correlation (containing similar elements like faces/buildings, etc) among their elements? Metrics (1) Precision (Pr@N): percentage of patches among the top N that are relevant/meaningful to the corresponding event, averaged among all events. (2) Success (S@N@D): percentage of responses, where there exist at least D relevant/meaningful patches amongst the top N. (3) Mean Reciprocal Rank (MRR): Computed as 1/r, where r is the rank of the first relevant/meaningful patch returned, averaged over all events.

Future Work Currently, our approach is centered around events that contain images with relative stylistic coherence and uniqueness, and thus #VisualHashtags generated also focus on concrete entities. As future work, the technique can be modified to summarize more abstract phenomenon like violence, summer, etc. Mid-level patches obtained as a summary of a particular viral event, can be further generalized to pave way for finding higher-level image features that can cover the essence of an event. While the current approach needs to be re-run to generate #VisualHashtags at different time instances, dynamic re-summarization would be an interesting direction to explore, making it a more real-time system.