Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu

Slides:



Advertisements
Similar presentations
Suleyman Cetintas 1, Monica Rogati 2, Luo Si 1, Yi Fang 1 Identifying Similar People in Professional Social Networks with Discriminative Probabilistic.
Advertisements

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Learning more about Facebook and Twitter. Introduction  What we’ve covered in the Social Media webinar series so far  Agenda for this call Facebook.
Elizeu Santos-Neto, Flavio Figueiredo Jussara Almeida, Miranda Mowbray Marcos Gonçalves, Matei Ripeanu The 2 nd IEEE SocialCom/SIN -- August 2010.
Search Engines and Information Retrieval
Learning to Advertise. Introduction Advertising on the Internet = $$$ –Especially search advertising and web page advertising Problem: –Selecting ads.
Tracking User Attention in Collaborative Tagging Communities Elizeu Santos-Neto Matei Ripeanu Univesity of British Columbia Adriana Iamnitchi University.
Towards Boosting Video Popularity via Tag Selection Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia -
Search Engines and Information Retrieval Chapter 1.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
What are these all types of?. What is the purpose of advertising? To attract new buyers and try to expand customer base To compete in the market To create.
APPLYING EPSILON-DIFFERENTIAL PRIVATE QUERY LOG RELEASING SCHEME TO DOCUMENT RETRIEVAL Sicong Zhang, Hui Yang, Lisa Singh Georgetown University August.
A Geographical Characterization of YouTube: a Latin American View Fernando Duarte, Fabrício Benevenuto, Virgílio Almeida, Jussara Almeida Federal University.
DETECTING SPAMMERS AND CONTENT PROMOTERS IN ONLINE VIDEO SOCIAL NETWORKS Fabrício Benevenuto ∗, Tiago Rodrigues, Virgílio Almeida, Jussara Almeida, and.
How YouTube Can Grow Your Business. YouTube is not only the second largest search engine in the world, but it also has over a billion unique, worldwide.
No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.
Summarization of XML Documents K Sarath Kumar. Outline I.Motivation II.System for XML Summarization III.Ranking Model and Summary Generation IV.Example.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
The Tube Over Time: Characterizing Popularity Growth of YouTube Videos ` Abstract In this work, we characterize the growth patterns of video popularity.
UKOLN is supported by: Introduction To Blogs And Social Networks For Heritage Organisations: Sharing Best Practices Brian Kelly UKOLN University of Bath.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
INFOMGP Student names and numbers Papers’ references Title.
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
Grow Your Business with Social Marketing
Digital Marketing Strategy by BCP Design. Digital Marketing Strategies In a digital age, where most users have computers and mobile devices, accessing.
The following Pitch Deck aims to help you sell Facebook Paid Ads to your clients. REMINDER: Get statistics and talking points on Facebook Paid Ads to.
Search Engine Optimization
Information Retrieval in Practice
Information Retrieval in Practice
Search Engine Optimization (SEO)
Recommendation in Scholarly Big Data
Content Reuse and Interest Sharing in Tagging Communities
SEARCH ENGINE OPTIMIZATION
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Preface to the special issue on context-aware recommender systems
Parts of an Academic Paper
Personalized Social Image Recommendation
SEARCH ENGINE OPTIMIZATION
SEARCH ENGINE OPTIMIZATION SEO. What is SEO? It is the process of optimizing structure, design and content of your website in order to increase traffic.
What is Digital Marketing? What is the use of Digital Marketing? Strategies of Digital Marketing Opportunities Search Engine Optimization.
Key points Content :- What is Digital Marketing? What is the use of Digital Marketing? Strategies of Digital Marketing Opportunities Search Engine Optimization.
Overview The promotion of products or brands via Digital media Digital Media  Search Engine Marketing Search Engine Marketing  Social Media Marketing.
The Language Schools Guide to Digital Marketing for International Recruitment Philippe Taza CEO Higher Education Marketing
Pilot project training
Chapter 4 Online Consumer Behavior, Market Research, and Advertisement
Team member 1, school Team member 2, school Team member 3, school
End of Year Performance Review Meetings and objective setting for 2018/19 This briefing pack is designed to be used by line managers to brief their teams.
Mathematics Lesson 1: Handling Data – Bar Charts
Time-Travel Using Spatially-Variant Temporal Metamaterials
Postdoc, School of Information, University of Arizona
Weakly Learning to Match Experts in Online Community
Improve Phase Wrap Up and Action Items
KDD Reviews 周天烁 2018年5月9日.
Project Title Team Members EE/CSCI 451: Project Presentation
Disambiguation Algorithm for People Search on the Web
Why listen to me? Sr. Digital Marketing Specialist for Fastline Media Group Social media is my world Fastline has seen a… 1,044% growth in Facebook audience.
What the Editors want to see!
Guidelines Dear author(s),
Presentation and project
Presentation and project
Presented By:- Abhinav Shashtri. Index SR.NOTitleSlide No 1Introduction: Build Awareness: Buildup Brand Image: Content Improves Website.
Research Portfolio Name: …
Social Media Interview
Presentation transcript:

Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu On the Choice of Data Sources to Improve Content Discoverability via Textual Feature Optimization Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia - Vancouver, Canada Universidade Federal de Minas Gerais - Belo Horizonte, Brazil {elizeus,matei}@ece.ubc.ca {tpontes,jussara}@dcc.ufmg.br Santiago, Chile - September 4th, 2014

Introduction 100 hours of video per minute ads ads specialized content management companies are responsible for: publishing monitoring promoting the owner's content revenue shared between the manager and the video's owner As revenue is directly related to the number of ad prints, this motivates managers to boost content popularity. Start showing that: 1. the amount of video produced today is enormous 2. there are companies responsible to manage these content in order to make money through ads associated with the videos 3. since revenue is related to video popularity, this is a *relevant concern*

Introduction Title Viewers reach content items via different leads: email Title video repository keyword/tag-based search URL website promotion campaign Description Tag Tag Tag Tag comment Here, you need to emphasize that a large portion of viewers reach the content via keyword/tag-based search.

Introduction Title Description Tag comment Textual features have a major impact on the view count of a video, and consequently on the advertisement-generated revenues. This work focuses on the automated tag selection to boost video popularity. Here, we show how textual features are related to view count and, consequently, to the ad-generated revenues.

Context Assumption: annotating a video with the terms users would use to search for it increases the chance that users view the video. Textual sources related to the video and whose content can be automatically retrieved can be used as inputs for recommenders to suggest tags. - Recommendation Pipeline -

Context In this slide you should present and explain each data source, then you mention the filters used reaching the 'candidate keywords' box. The next slides will continue the pipeline. Experts Peers

Our Goal Q1. Is there room for improvement in the current YouTube video tags? [1] Santos-Neto, E., Pontes, T., Almeida, J., and Ripeanu, M. Towards Boosting Video Popularity via Tag Selection. Workshop on Social Multimedia and Storytelling, (2014) Q2. How do peer- and expert-produced input data sources compare regarding the quality of the recommended tags? Q3. Is the quality of the recommendation affected by the number of contributing peers? Here, we point out our *general focus* and research questions. You should explain that the 1st research question was answered in our last paper and motivated our current study.

Building the Ground Truth Amazon Mechanical Turk (AMT) Our task - Watch a movie trailer video and answer the following question: What query terms would you use to search for this video? For each video, associate a minimum of 3 and a maximum of 10 keywords. Talk about the ideal ground truth and the challenges to build it. Then, present our alternative solution (remember to talk about the pilot survey, if you think it is necessary).

Quality Control - each evaluation was inspected before approval. AMT Task Properties Property Value Number of Videos 382 Number of Turkers 33 Number of Evaluations 1,146 Payment per Task $0.30 Payment per Hour $3.00 Total Cost $345.00 I think it's interesting to mention that we take about 24 hours to get all the evaluations we need with the amount of money we made available in AMT. Also, emphasize the *quality control* as a strategy to avoid spam. You can point out that we rejected just one evaluation and say why. Quality Control - each evaluation was inspected before approval.

Experimental Evaluation By improvement we mean extending/modifying the tags of YouTube videos to better match the ground truth. - Success Metric: F3-measure Performance of the original YouTube tagset Performance of the recommended tagset produced by different data sources X I’m not sure if this slide is good. Here, we need to say what we want to evaluate/compare.

Q1. Is there room for improvement in the current YouTube video tags? Experimental Results Q1. Is there room for improvement in the current YouTube video tags?

Q1. Is there room for improvement in the current YouTube video tags? Experimental Results Q1. Is there room for improvement in the current YouTube video tags? Rotten Tomatoes - best source of candidate keywords followed by Movie Lens. Rotten Tomatoes incorporates a schema for the information provided (named entities). The difference in recommender’s performance for each pair of data source is statistically significant.

Experimental Results Q2. How do peer- and expert-produced input data sources compare regarding the quality of the recommended tags?

currently assigned to YouTube videos Experimental Results Q2. How do peer- and expert-produced input data sources compare regarding the quality of the recommended tags? Peer-produced data source is significantly better than using the expert-produced data. For frequency, the performance of peer-produced source is comparable to that of Rotten Tomatoes. Peer-produced source provides improvement relative to the tags currently assigned to YouTube videos (statistically significant for frequency).

Experimental Results Q3. Is the quality of the recommendation affected by the number of contributing peers? How many peers is an expert worth? Spearman’s rank correlation between the number of users and F3-measure for Movie Lens: 0.31 (mild positive correlation). The number of contributors partially explains the value added by Movie Lens to the recommenders’ performance. One potential reason is that the choices of keywords to tag a movie or search for it may be driven by different purposes. E.g.: A user may use the keyword 'boring' to tag a movie as an indication about his opinion about the video. However, this tag may not be considered a good choice for another user who want to search for the same video.

Conclusions Rotten Tomatoes has a schema of named entities that makes the keywords obtained from it significantly more promising than the ones already assigned to YouTube videos; Peer-produced data sources provide better candidate keywords for our automated tag recommendation than expert-produced sources; The number of contributing peers partially explains why peer-produced data sources outperform expert ones.

Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu On the Choice of Data Sources to Improve Content Discoverability via Textual Feature Optimization Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia - Vancouver, Canada Universidade Federal de Minas Gerais - Belo Horizonte, Brazil {elizeus,matei}@ece.ubc.ca {tpontes,jussara}@dcc.ufmg.br Santiago, Chile - September 4th, 2014

Thank you!

Context Frequency - based on tag frequency patterns. Score candidate keywords based on how often they appear in the data provided by a data source. Random Walk - based on tag co-occurrence patterns. Builds a graph where each keyword is a node and an edge connects two keywords if they co-occur. The final node scores are used to rank the candidate keywords. * Our goal is not to design a novel and more efficient tag recommendation algorithm. Highlight that the algorithms are here used as a backdrop for our original problem. We do NOT aim to propose a new and revolutionary recommendation algorithm.

Context Analyse how automated tag selection optimize the tags associated to YouTube videos to attract search traffic. Evaluate how the choice of the data source used as input for a recommendation pipeline impacts the quality of the recommended tagset. This slide summarizes our general goal with the recommendation pipeline. So, after explaining the main components, you should summarize *our goal* to make the audience aware about what specifically we want to do. The next slide contains our research questions.