Download presentation
Presentation is loading. Please wait.
Published byStephen Owens Modified over 8 years ago
1
1/54 Rumor Source Detection: A Group Testing Approach Ding-Zhu Du Department of Computer Science University of Texas at Dallas
2
2/54 OUTLINE I.Background II.Rumor Source Detection Problem III.Group Testing Approach
3
3/54 Social Networks > 1.3 billion users The 2 nd largest “Country” in the world More visitors than Google More than 6 billion images 2009, 2 billion tweets per quarter 2010, 4 billion tweets per quarter 2011, 25 billion tweets per quarter > 800 million users Pinterest, with a traffic higher than Twitter and Google 2013, 400 million users, 40% yearly increase
4
4/54 A Trillion Dollar Opportunity Online Social networks have become a bridge to connect our daily physical life and the virtual web space On2Off Commerce [1] [1] Online to Offline is trillion dollar business http://techcrunch.com/2010/08/07/why-online2offline-commerce-is-a-trillion-dollar-opportunity/
5
5/54 Influence Propagation I love Obama I hate Obama, the worst president ever Positive Negative
6
6/54 What is Social Influence? Social influence occurs when one's opinions, emotions, or behaviors are affected by others, intentionally or unintentionally. [1] [1] http://en.wikipedia.org/wiki/Social_influence
7
7/54 Three Degree of Influence Three degree of Influence [2] Six degree of separation [1] Each person may influence 150 persons [3] In total, you are able to influence about 1,000,000 (150 3 ) persons in the world! Each person may influence 150 persons [3] In total, you are able to influence about 1,000,000 (150 3 ) persons in the world! [1] S. Milgram. The Small World Problem. Psychology Today, 1967, Vol. 2, 60–67 [2] J.H. Fowler and N.A. Christakis. The Dynamic Spread of Happiness in a Large Social Network: Longitudinal Analysis Over 20 Years in the Framingham Heart Study. British Medical Journal 2008; 337: a2338 [3] R. Dunbar. Neocortex size as a constraint on group size in primates. Human Evolution, 1992, 20: 469–493.
8
8/54 6/13/2016 8 When misinformation or rumor spreads in social networks, what will happen?
9
9/54 A misinformation said that the president of Syria is dead, and it hit the twitter greatly and was circulated fast among the population, leading to a sharp, quick increase in the price of oil. http://news.yahoo.com/blogs/technology-blog/twitter-rumor- leads-sharp-increase-price-oil-173027289.html 6/13/2016 9
10
10/54 In August, 2012, thousands of people in Ghazni province left their houses in the middle of the night in panic after the rumor of earthquake. http://www.pajhwok.com/en/2012/08/20/quak e-rumour-sends-thousands-ghazni-streets 6/13/2016 10
11
11/54 6/13/2016 11
12
12/54 OUTLINE I.Background II.Rumor Source Detection Problem III.Group Testing Approach
13
13/54 Motivation Rumors spread through the network We only see who received rumor but not where they got rumor from Can we locate the hidden rumor sources?
14
14/54 Motivation Rumors spread through the network We only see who received rumor but not where they got rumor from Can we locate the hidden rumor sources?
15
15/54 Problem Description Given –Social network structure –Infection time of monitors Goal –Select a subset of vertices with minimum cardinality such that the rumor source can be uniquely located. Question –Which set of vertices should we select? Applications –Epidemiology: Virus –Social Media: Rumor
16
16/54 Related Work Shah and Zaman, 2010, 2011, 2012: –“Rumor Centrality”-single source, Susceptible- Infected (SI) model Luo and Tay, 2012: –Multiple sources, Susceptible-Infected-Recovered (SIR) model Zhu and Ying, 2013: –Single source estimation for SIR model Seo et al., 2012; Karamchandani and Franceschetti, 2013; Luo and Tay, 2013; Zhu and Ying, 2014: –Partial observations
17
17/54 OUTLINE I.Background II.Rumor Source Detection Problem III.Group Testing Approach
18
18/54 What is Group Testing? Part I 18
19
19/54 An Example There are 9 items with 1 defective. Please identify the defective item with the minimum number of tests. Each test on a subset of items can tell whether the subset contains the defective item or not. 19
20
20/54 20 3 tests are enough? 1 2 3 4 5 6 7 8 9 1 2 3 4 5 4 5
21
21/54 21 No, sometime you need 4 tests. 1 2 3 4 5 6 7 8 9 1 2 3 4 5 1 2 3 1 2
22
22/54 Minimum # of Tests Goal: minimize # of tests in the worst case. So, for bisecting on 9 items with 1 defective, the # of tests in the worst case is 4. For bisecting on n items with 1 defective, require tests 22
23
23/54 This is the best you can do! There are totally n outcomes. Each tests has two outcomes, which divides a subset of outcomes into two smaller subsets. Suppose k tests are enough. Then 23
24
24/54 Two Defective Items Need at least tests. For some n, the lower bound can be reached. But, for some n, the lower bound can not be reached. This case is complicated. 24
25
25/54 3 or more defective items Very hard to find the optimal algorithm for determine tests!!! 25
26
26/54 26 Classical Group Testing Given n items with some positive ones, identify all positive ones by less number of tests. Each test is on a subset of items. Test outcome is positive iff there is a positive item in the subset.
27
27/54 27 Idea of Group Testing (GT) ___ __ __ + positivenegative + ___ ___ ___
28
28/54 28 History During World War II, to test syphilitic antigen, David Rosenblatt first proposed the idea. In 1943, Robert Dorfman published the first paper on Group Testing. After 1943, there are many papers published and many applications discovered.
29
29/54 29
30
30/54 30 Group Testing Sequential Group Testing Nonadaptive Group Testing Pooling Design (Biology)
31
31/54 31 Example 1 - Sequential 1 2 3 4 5 6 7 8 9 1 2 3 4 5 4 5
32
32/54 32 Example 2 – Non-adaptive p1 1 2 3 p2 4 5 6 p3 7 8 9 p4 p5 p6 O ( ) tests for n items
33
33/54 33
34
34/54 34 Sequential and Non-adaptive Sequential GT needs less number of tests, but longer time. Non-adaptive GT needs more tests, but shorter time. In molecular biology, non-adaptive GT is usually taken. Why?
35
35/54 35 Los Alamos Labs in 1998 Face 220,000 clones to do screening. If test individually, need 220,000 tests. Actually, use 376 tests. What is the technique?
36
36/54 Methodology for Rumor Source Detection Definition (Set Resolving Set (SRS)). Node set K ⊆ V is an SRS if any different detectable node sets A,B ∈ V are distinguishable by K. Two node sets A,B ⊆ V are distinguishable by K if there exist two nodes x, y ∈ K such that – : the time that node x received the rumor from A,
37
37/54 Influence Propagation Model Rumor propagates from the sources to any vertex through shortest paths in the network. As soon as a vertex receives the information, it sends the information to all its neighbors simultaneously, which takes one time unit. Thus, the time that a rumor initiated at node u is received by node v is r u (v) = s(u) + d(u, v).
38
38/54 An Example of Set Resolving Set (SRS) A EF D B C { A,B,C } is a SRS. ABC r(A)-r(B)r(A)-r(C)r(B)-r(C) A011 0 B102 1 -2 C12012 D211110 E121 01 F21210
39
39/54 An Example of Set Resolving Set (SRS) A EF D B C { A,C } is not a SRS. AC r(A)-r(C) E110 F220
40
40/54
41
41/54 Active Source & Inactive Source Only active source is detectable.
42
42/54 Theorem For any graph, there exists a SRS
43
43/54 Problem Definition MULTI-RUMOR-SOURCE DETECTION problem (MRSD): find a SRS K with the smallest cardinality.
44
44/54 Greedy Algorithm & Its Approximation Ratio Theorem. Algorithm1 correctly computes a SRS with provable approximation ratio of at most (1 + r ln n + ln log 2 ). – r : upper bound for the number of sources – : maximum number of equivalence classes divided by one node-pair.
45
45/54 THANK YOU VERY MUCH!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.