Download presentation
Presentation is loading. Please wait.
Published byIrene Bennett Modified over 9 years ago
1
Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research Center for Artificial Intelligence (DFKI) ALL RIGHTS RESERVED. No part of this work may be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system without expressed written permission from the authors.
2
Analysis and Forecasting of Trending Topics in Online Media Streams 2 Motivation
3
Analysis and Forecasting of Trending Topics in Online Media Streams 3 Large-scale Record of Human Behavior Primary method for communication by college students in the U.S. 340 million tweets each day by 500 million users One billion search requests and about twenty petabytes of user-generated data each day (2009)
4
Analysis and Forecasting of Trending Topics in Online Media Streams 4 Motivation People use the web for news, information, and communication Online activity becomes indicative of the interests of the global population › „Mirror of Society“ [Steven Van Belleghem, 2012]
5
Analysis and Forecasting of Trending Topics in Online Media Streams 5 Use Cases of Online Activity
6
Analysis and Forecasting of Trending Topics in Online Media Streams 6 Tracking Flu Infections flu remedy [Ginsberg et al. Detecting influenza epidemics using search engine query data. Nature. 2008.]
7
Analysis and Forecasting of Trending Topics in Online Media Streams 7 Relevance for the MM Community Panel: Cross-Media Analysis and Mining Focus more on the time dimension How do topics of interest develop over time?
8
Analysis and Forecasting of Trending Topics in Online Media Streams 8 Relevance for the MM Community Panel: Cross-Media Analysis and Mining Focus more on the time dimension How do topics of interest develop over time? Trend aware recommendation systems Automatic vocabulary selection for classification
9
Analysis and Forecasting of Trending Topics in Online Media Streams 9 Shortcomings of Related Work 1.Focus on very specific use cases (e.g. flu) 2.Nowcasting instead of forecasting 3.Manual correlation analysis instead of fully automatic forecasting 4.No comparison/study of different online & social media channels wrt. emerging trends [Choi and Varian. Predicting initial claims for unemployment benefits. Google, Inc. 2009.] [Choi and Varian. Predicting the Present with Google Trends. Economic Record, 2012.] [Ginsberg et al. Detecting influenza epidemics using search engine query data. Nature. 2008.] [Cooper et al. Cancer internet search activity on a major search engine. Journal of medical Internet research, 2005.] [Goel et al. Predicting consumer behavior with web search. National Academy of Sciences, 2010.] [Jin et al. The wisdom of social multimedia: using Flickr for prediction and forecast. ACM MM 2010.] [Adar et al. Why we search: visualizing and predicting user behavior. WWW, 2007.]
10
Analysis and Forecasting of Trending Topics in Online Media Streams 10 Goals 1.Multi-channel analysis of trending topics 2.Fully automatic forecasting of trending topics
11
Analysis and Forecasting of Trending Topics in Online Media Streams 11 Goals 1.Multi-channel analysis of trending topics 2.Fully automatic forecasting of trending topics
12
Analysis and Forecasting of Trending Topics in Online Media Streams 12 Trending Topics
13
Analysis and Forecasting of Trending Topics in Online Media Streams 13 Trending Topics A trending topic is a subject matter of discussion agreed on by a group of people experiences a sudden spike in user interest or engagement
14
Analysis and Forecasting of Trending Topics in Online Media Streams 14 Daily crawling of 10 sources Capturing Communication needs (Twitter) Search patterns (Google) Information demand (Wikipedia) Trending Topics Discovery Trends Search Germany Search USA News Germany News USA English Wikipedia German Wikipedia Trends Germany Trends USA Daily Trends Google [Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]
15
Analysis and Forecasting of Trending Topics in Online Media Streams 15 Daily crawling of 10 sources Capturing Communication needs (Twitter) Search patterns (Google) Information demand (Wikipedia) Trending Topics Discovery Trends Search Germany Search USA News Germany News USA English Wikipedia German Wikipedia Trends Germany Trends USA Daily Trends Dataset Observation period – Sept. 2011 – Sept. 2012 40,000 items analyzed ~ 3,000 trending topics discovered Google [Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]
16
Analysis and Forecasting of Trending Topics in Online Media Streams 16 Trending Topics Discovery
17
Analysis and Forecasting of Trending Topics in Online Media Streams 17 Trending Topics Discovery
18
Analysis and Forecasting of Trending Topics in Online Media Streams 18 Goals 1.Multi-channel analysis of trending topics 2.Fully automatic forecasting of trending topics
19
Analysis and Forecasting of Trending Topics in Online Media Streams 19 Multi-Channel Analysis
20
Analysis and Forecasting of Trending Topics in Online Media Streams 20 Example Neil Armstrong died on August 25, 2012.
21
Analysis and Forecasting of Trending Topics in Online Media Streams 21 Lifetime Analysis
22
Analysis and Forecasting of Trending Topics in Online Media Streams 22 Topic Category Analysis
23
Analysis and Forecasting of Trending Topics in Online Media Streams 23 Topic Category Analysis
24
Analysis and Forecasting of Trending Topics in Online Media Streams 24 Topic Category Analysis
25
Analysis and Forecasting of Trending Topics in Online Media Streams 25 Topic Category Analysis
26
Analysis and Forecasting of Trending Topics in Online Media Streams 26 Forecasting
27
Analysis and Forecasting of Trending Topics in Online Media Streams 27 Forecasting time observed valuesforecast
28
Analysis and Forecasting of Trending Topics in Online Media Streams 28 Basic Forecasting Techniques Naive Linear Trend Average/Median ARMA Nearest Neighbor
29
Analysis and Forecasting of Trending Topics in Online Media Streams 29 Basic Forecasting Techniques Naive Linear Trend Average/Median ARMA Nearest Neighbor
30
Analysis and Forecasting of Trending Topics in Online Media Streams 30 Basic Forecasting Techniques Naive Linear Trend Average/Median ARMA Nearest Neighbor
31
Analysis and Forecasting of Trending Topics in Online Media Streams 31 Basic Forecasting Techniques Naive Linear Trend Average/Median ARMA Nearest Neighbor
32
Analysis and Forecasting of Trending Topics in Online Media Streams 32 Basic Forecasting Techniques Naive Linear Trend Average/Median ARMA Nearest Neighbor
33
Analysis and Forecasting of Trending Topics in Online Media Streams 33 Example Signal
34
Analysis and Forecasting of Trending Topics in Online Media Streams 34 Forecasting Can Be Very Challenging
35
Analysis and Forecasting of Trending Topics in Online Media Streams 35 Recurring Signal
36
Analysis and Forecasting of Trending Topics in Online Media Streams 36 Recurring Signal
37
Analysis and Forecasting of Trending Topics in Online Media Streams 37 What To Do In This Case?
38
Analysis and Forecasting of Trending Topics in Online Media Streams 38 What To Do In This Case? Semantics!
39
Analysis and Forecasting of Trending Topics in Online Media Streams 39 Use Semantically Similar Topics
40
Analysis and Forecasting of Trending Topics in Online Media Streams 40 Approach
41
Analysis and Forecasting of Trending Topics in Online Media Streams 41 Approach
42
Analysis and Forecasting of Trending Topics in Online Media Streams 42 Approach
43
Analysis and Forecasting of Trending Topics in Online Media Streams 43 Discovering Semantically Similar Topics Trending Topic: “Olympics 2012”
44
Analysis and Forecasting of Trending Topics in Online Media Streams 44 Nearest Neighbor Sequence Matching
45
Analysis and Forecasting of Trending Topics in Online Media Streams 45 Forecasting
46
Analysis and Forecasting of Trending Topics in Online Media Streams 46 Forecasting
47
Analysis and Forecasting of Trending Topics in Online Media Streams 47 Forecasting
48
Analysis and Forecasting of Trending Topics in Online Media Streams 48 Forecasting
49
Analysis and Forecasting of Trending Topics in Online Media Streams 49 Forecasting
50
Analysis and Forecasting of Trending Topics in Online Media Streams 50 Forecasting
51
Analysis and Forecasting of Trending Topics in Online Media Streams 51 Evaluation
52
Analysis and Forecasting of Trending Topics in Online Media Streams 52 Dataset Public dataset of Wikipedia view statistics Historical data from 2008 until today Large-scale: 5 million articles attracting 870 million views each day – 2.8 TB compressed → Semantic similarity between 5 million articles → 9 billion sequences to choose from Use 200 top trending topics for evaluation
53
Analysis and Forecasting of Trending Topics in Online Media Streams 53 Discovering Semantically Similar Topics
54
Analysis and Forecasting of Trending Topics in Online Media Streams 54 Example: “The Hunger Games”
55
Analysis and Forecasting of Trending Topics in Online Media Streams 55 Example: “The Hunger Games” Trending Topic Trigger 14 day horizon Time in days Wikipedia article views Different models
56
Analysis and Forecasting of Trending Topics in Online Media Streams 56 Example: “The Hunger Games”
57
Analysis and Forecasting of Trending Topics in Online Media Streams 57 Example: “The Hunger Games”
58
Analysis and Forecasting of Trending Topics in Online Media Streams 58 Example: “The Hunger Games”
59
Analysis and Forecasting of Trending Topics in Online Media Streams 59 Example: “The Hunger Games”
60
Analysis and Forecasting of Trending Topics in Online Media Streams 60 Example: “The Hunger Games”
61
Analysis and Forecasting of Trending Topics in Online Media Streams 61 Example: “The Hunger Games”
62
Analysis and Forecasting of Trending Topics in Online Media Streams 62 Example: “The Hunger Games”
63
Analysis and Forecasting of Trending Topics in Online Media Streams 63 Example: “The Hunger Games”
64
Analysis and Forecasting of Trending Topics in Online Media Streams 64 Example: “The Hunger Games”
65
Analysis and Forecasting of Trending Topics in Online Media Streams 65 Example: “The Hunger Games”
66
Analysis and Forecasting of Trending Topics in Online Media Streams 66 Example: “The Hunger Games”
67
Analysis and Forecasting of Trending Topics in Online Media Streams 67 Example: “The Hunger Games”
68
Analysis and Forecasting of Trending Topics in Online Media Streams 68 Example: “The Hunger Games”
69
Analysis and Forecasting of Trending Topics in Online Media Streams 69 Example: “The Hunger Games”
70
Analysis and Forecasting of Trending Topics in Online Media Streams 70 Quantitative Results
71
Analysis and Forecasting of Trending Topics in Online Media Streams 71 Root Mean Square Error
72
Analysis and Forecasting of Trending Topics in Online Media Streams 72 Root Mean Square Error
73
Analysis and Forecasting of Trending Topics in Online Media Streams 73 Root Mean Square Error
74
Analysis and Forecasting of Trending Topics in Online Media Streams 74 Mean Average Percentage Error
75
Analysis and Forecasting of Trending Topics in Online Media Streams 75 Mean Average Percentage Error
76
Analysis and Forecasting of Trending Topics in Online Media Streams 76 Mean Average Percentage Error
77
Analysis and Forecasting of Trending Topics in Online Media Streams 77 Conclusion
78
Analysis and Forecasting of Trending Topics in Online Media Streams 78 Main Contributions 1.Multi-channel analysis of trending topics 2.Fully automatic forecasting technique exploiting semantic relationships between topics 3.Empirical evaluation on a large-scale Wikipedia dataset
79
Analysis and Forecasting of Trending Topics in Online Media Streams 79 Insights 1.Trends on Twitter and Wikipedia are significantly more ephemeral than on Google. 2.All observed media channels tend to specialize in specific topic categories. 3.Semantically similar topics exhibit similar behavior. 4.Exploiting this observation can significantly improve forecasting performance (by 20-90% compared to baselines).
80
Analysis and Forecasting of Trending Topics in Online Media Streams 80 Acknowledgments
81
Analysis and Forecasting of Trending Topics in Online Media Streams 81 Questions? Thank you for your attention Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel Analysis and Forecasting of Trending Topics in Online Media Streams
82
82 Backup Slides
83
Analysis and Forecasting of Trending Topics in Online Media Streams 83 Motivation Big Data: “High volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization” Large-scale user behavior › „Mirror of Society“ [Mark A. Beyer and Douglas Laney. The importance of ’big data’: A definition. June 2012.]
84
Analysis and Forecasting of Trending Topics in Online Media Streams 84 Large-scale Record of Human Behavior 1.2 million video minutes per second (est. 2016) Primary method for communication by college students in the U.S. 25 million articles and 1.8 billion page edits in 285 languages created by 100k contributors 340 million tweets each day by 500 million users One billion search requests and about twenty petabytes of user-generated data each day (2009)
85
Analysis and Forecasting of Trending Topics in Online Media Streams 85 Unemployment Claims jobs [Choi and Varian. Predicting initial claims for unemployment benefits. Google, Inc. 2009.]
86
Analysis and Forecasting of Trending Topics in Online Media Streams 86 What is the world talking about? Social Media can offer insights in what the world is talking about how people feel about events, brands and products resulting in an collective awareness for trending topics A trending topic is a subject matter of discussion agreed on by a group of people experiences a sudden spike in user interest or engagement [Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]
87
Analysis and Forecasting of Trending Topics in Online Media Streams 87 Trending Topics Discovery
88
Analysis and Forecasting of Trending Topics in Online Media Streams 88 Trending Topics Discovery
89
Analysis and Forecasting of Trending Topics in Online Media Streams 89 Lifetime Analysis
90
Analysis and Forecasting of Trending Topics in Online Media Streams 90 Lifetime Analysis
91
Analysis and Forecasting of Trending Topics in Online Media Streams 91 Lifetime Analysis
92
Analysis and Forecasting of Trending Topics in Online Media Streams 92 Lifetime Analysis
93
Analysis and Forecasting of Trending Topics in Online Media Streams 93 Example 2: “Battlefield 3”
94
Analysis and Forecasting of Trending Topics in Online Media Streams 94 Example 2: “Battlefield 3” Trending Topic Trigger 14 day horizon Time in days Wikipedia article views
95
Analysis and Forecasting of Trending Topics in Online Media Streams 95 Example 2: “Battlefield 3”
96
Analysis and Forecasting of Trending Topics in Online Media Streams 96 Example 2: “Battlefield 3”
97
Analysis and Forecasting of Trending Topics in Online Media Streams 97 Example 2: “Battlefield 3”
98
Analysis and Forecasting of Trending Topics in Online Media Streams 98 Example 2: “Battlefield 3”
99
Analysis and Forecasting of Trending Topics in Online Media Streams 99 Example 2: “Battlefield 3”
100
Analysis and Forecasting of Trending Topics in Online Media Streams 100 Example 2: “Battlefield 3”
101
Analysis and Forecasting of Trending Topics in Online Media Streams 101 Example 2: “Battlefield 3”
102
Analysis and Forecasting of Trending Topics in Online Media Streams 102 Example 2: “Battlefield 3”
103
Analysis and Forecasting of Trending Topics in Online Media Streams 103 Example 2: “Battlefield 3”
104
Analysis and Forecasting of Trending Topics in Online Media Streams 104 Example 2: “Battlefield 3”
105
Analysis and Forecasting of Trending Topics in Online Media Streams 105 Example 2: “Battlefield 3”
106
Analysis and Forecasting of Trending Topics in Online Media Streams 106 Example 2: “Battlefield 3”
107
Analysis and Forecasting of Trending Topics in Online Media Streams 107 Example 2: “Battlefield 3”
108
Analysis and Forecasting of Trending Topics in Online Media Streams 108 Example 2: “Battlefield 3”
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.