Presentation is loading. Please wait.

Presentation is loading. Please wait.

Large Graph Mining: Power Tools and a Practitioner’s guide

Similar presentations


Presentation on theme: "Large Graph Mining: Power Tools and a Practitioner’s guide"— Presentation transcript:

1 Large Graph Mining: Power Tools and a Practitioner’s guide
Task 6: Virus/Influence Propagation Faloutsos, Miller,Tsourakakis CMU KDD'09 Faloutsos, Miller, Tsourakakis

2 Faloutsos, Miller, Tsourakakis
Outline Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Recommendations Task 4: Connection sub-graphs Task 5: Mining graphs over time Task 6: Virus/influence propagation Task 7: Spectral graph theory Task 8: Tera/peta graph mining: hadoop Observations – patterns of real graphs Conclusions KDD'09 Faloutsos, Miller, Tsourakakis

3 Faloutsos, Miller, Tsourakakis
Detailed outline Epidemic threshold Problem definition Analysis Experiments Fraud detection in e-bay KDD'09 Faloutsos, Miller, Tsourakakis

4 Faloutsos, Miller, Tsourakakis
Virus propagation How do viruses/rumors propagate? Blog influence? Will a flu-like virus linger, or will it become extinct soon? KDD'09 Faloutsos, Miller, Tsourakakis

5 Faloutsos, Miller, Tsourakakis
The model: SIS ‘Flu’ like: Susceptible-Infected-Susceptible Virus ‘strength’ s= b/d Healthy Prob. d N2 Prob. b Prob. β N1 N Infected N3 KDD'09 Faloutsos, Miller, Tsourakakis

6 Epidemic threshold t of a graph: the value of t, such that
if strength s = b / d < t an epidemic can not happen Thus, given a graph compute its epidemic threshold KDD'09 Faloutsos, Miller, Tsourakakis

7 Faloutsos, Miller, Tsourakakis
Epidemic threshold t What should t depend on? avg. degree? and/or highest degree? and/or variance of degree? and/or third moment of degree? and/or diameter? KDD'09 Faloutsos, Miller, Tsourakakis

8 Faloutsos, Miller, Tsourakakis
Epidemic threshold [Theorem 1] We have no epidemic, if β/δ <τ = 1/ λ1,A KDD'09 Faloutsos, Miller, Tsourakakis

9 Faloutsos, Miller, Tsourakakis
Epidemic threshold [Theorem 1] We have no epidemic (*), if epidemic threshold recovery prob. β/δ <τ = 1/ λ1,A largest eigenvalue of adj. matrix A attack prob. Proof: [Wang+03] (*) under mild, conditional-independence assumptions KDD'09 Faloutsos, Miller, Tsourakakis

10 Faloutsos, Miller, Tsourakakis
details Beginning of proof t+1: - ( healthy or healed ) - and not t Let: p(i , t) = Prob node i is t+1 1 - p(i, t+1 ) = (1 – p(i, t) + p(i, t) * d ) * Pj (1 – b aji * p(j , t) ) Below threshold, if the above non-linear dynamical system above is ‘stable’ (eigenvalue of Hessian < 1 ) KDD'09 Faloutsos, Miller, Tsourakakis

11 Epidemic threshold for various networks
Formula includes older results as special cases: Homogeneous networks [Kephart+White] λ1,A = <k>; τ = 1/<k> (<k> : avg degree) Star networks (d = degree of center) λ1,A = sqrt(d); τ = 1/ sqrt(d) Infinite power-law networks λ1,A = ∞; τ = 0 ; [Barabasi] KDD'09 Faloutsos, Miller, Tsourakakis

12 Faloutsos, Miller, Tsourakakis
Epidemic threshold [Theorem 2] Below the epidemic threshold, the epidemic dies out exponentially KDD'09 Faloutsos, Miller, Tsourakakis

13 Faloutsos, Miller, Tsourakakis
Detailed outline Epidemic threshold Problem definition Analysis Experiments Fraud detection in e-bay KDD'09 Faloutsos, Miller, Tsourakakis

14 Current prediction vs. previous
Number of infected nodes PL-3 PL-3 Our Our β/δ β/δ Oregon Star The formula’s predictions are more accurate KDD'09 Faloutsos, Miller, Tsourakakis

15 Faloutsos, Miller, Tsourakakis
Experiments (Oregon) b/d > τ (above threshold) b/d = τ (at the threshold) b/d < τ (below threshold) KDD'09 Faloutsos, Miller, Tsourakakis

16 SIS simulation - # infected nodes vs time
above Log - Lin at #inf. (log scale) below Time (linear scale) KDD'09 Faloutsos, Miller, Tsourakakis

17 SIS simulation - # infected nodes vs time
above Log - Lin at #inf. (log scale) below Exponential decay Time (linear scale) KDD'09 Faloutsos, Miller, Tsourakakis

18 SIS simulation - # infected nodes vs time
above Log - Log at #inf. (log scale) below Time (log scale) KDD'09 Faloutsos, Miller, Tsourakakis

19 SIS simulation - # infected nodes vs time
above Log - Log at #inf. (log scale) below Power-law Decay (!) Time (log scale) KDD'09 Faloutsos, Miller, Tsourakakis

20 Faloutsos, Miller, Tsourakakis
extra Detailed outline Epidemic threshold Fraud detection in e-bay KDD'09 Faloutsos, Miller, Tsourakakis

21 Faloutsos, Miller, Tsourakakis
extra E-bay Fraud detection w/ Polo Chau & Shashank Pandit, CMU NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks, S. Pandit, D. H. Chau, S. Wang, and C. Faloutsos (WWW'07), pp KDD'09 Faloutsos, Miller, Tsourakakis

22 Faloutsos, Miller, Tsourakakis
extra E-bay Fraud detection lines: positive feedbacks would you buy from him/her? KDD'09 Faloutsos, Miller, Tsourakakis

23 Faloutsos, Miller, Tsourakakis
extra E-bay Fraud detection lines: positive feedbacks would you buy from him/her? or him/her? KDD'09 Faloutsos, Miller, Tsourakakis

24 E-bay Fraud detection - NetProbe
extra E-bay Fraud detection - NetProbe Belief Propagation gives: KDD'09 Faloutsos, Miller, Tsourakakis

25 Faloutsos, Miller, Tsourakakis
Conclusions λ1,A : Eigenvalue of adjacency matrix determines the survival of a flu-like virus It gives a measure of how well connected is the graph (~ # paths – see Task 7, later) May guide immunization policies [Belief Propagation: a powerful algo] KDD'09 Faloutsos, Miller, Tsourakakis

26 Faloutsos, Miller, Tsourakakis
References D. Chakrabarti, Y. Wang, C. Wang, J. Leskovec, and C. Faloutsos, Epidemic Thresholds in Real Networks, in ACM TISSEC, 10(4), 2008 Ganesh, A., Massoulie, L., and Towsley, D., The effect of network topology on the spread of epidemics. In INFOCOM. KDD'09 Faloutsos, Miller, Tsourakakis

27 Faloutsos, Miller, Tsourakakis
References (cont’d) Hethcote, H. W The mathematics of infectious diseases. SIAM Review 42, 599–653. Hethcote, H. W. AND Yorke, J. A Gonorrhea Transmission Dynamics and Control. Vol. 56. Springer. Lecture Notes in Biomathematics. KDD'09 Faloutsos, Miller, Tsourakakis

28 Faloutsos, Miller, Tsourakakis
References (cont’d) Y. Wang, D. Chakrabarti, C. Wang and C. Faloutsos, Epidemic Spreading in Real Networks: An Eigenvalue Viewpoint, in SRDS 2003 (pages 25-34), Florence, Italy KDD'09 Faloutsos, Miller, Tsourakakis


Download ppt "Large Graph Mining: Power Tools and a Practitioner’s guide"

Similar presentations


Ads by Google