The Theory of Zeta Graphs with an Application to Random Networks Christopher Ré Stanford.

Slides:



Advertisements
Similar presentations
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Advertisements

Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Complex Networks: Complex Networks: Structures and Dynamics Changsong Zhou AGNLD, Institute für Physik Universität Potsdam.
Complex Networks Advanced Computer Networks: Part1.
Network analysis Sushmita Roy BMI/CS 576
Purnamrita Sarkar (Carnegie Mellon) Deepayan Chakrabarti (Yahoo! Research) Andrew W. Moore (Google, Inc.)
Scale Free Networks.
Jennifer Tour Chayes Joint work with N. Berger, C. Borgs, A. Ganesh, A. Saberi, D. B. Wilson Controlling the Spread of Viruses on Power-Law Networks.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
Information Networks Small World Networks Lecture 5.
Lecture 7 CS 728 Searchable Networks. Errata: Differences between Copying and Preferential Attachment In generative model: let p k be fraction of nodes.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.
On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Entropy Rates of a Stochastic Process
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
Alon Arad Alon Arad Hurst Exponent of Complex Networks.
Reduction Techniques Restriction Local Replacement Component Design Examples.
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Advanced Topics in Data Mining Special focus: Social Networks.
SDSC, skitter (July 1998) A random graph model for massive graphs William Aiello Fan Chung Graham Lincoln Lu.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Decomposing Networks and Polya Urns with the Power of Choice Joint work with Christos Amanatidis, Richard Karp, Christos Papadimitriou, Martha Sideri Presented.
Random Graph Models of Social Networks Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz Presentation presented by Jessie Riposo.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
2 Introduction: phase transition phenomena Phase transition: qualitative change as a parameter crosses threshold Matter temperature magnetism demagnetism.
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
Techniques for Proving NP-Completeness Show that a special case of the problem you are interested in is NP- complete. For example: The problem of finding.
Percolation Processes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopPercolation Processes1.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
1 Use graphs and not pure logic Variables represented by nodes and dependencies by edges. Common in our language: “threads of thoughts”, “lines of reasoning”,
Miniconference on the Mathematics of Computation
Models and Algorithms for Complex Networks
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
Network (graph) Models
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Stochastic Streams: Sample Complexity vs. Space Complexity
Sequential Algorithms for Generating Random Graphs
Topics In Social Computing (67810)
Bart M. P. Jansen June 3rd 2016, Algorithms for Optimization Problems
Models of Network Formation
Models of Network Formation
Models of Network Formation
Models of Network Formation
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
Network Science: A Short Introduction i3 Workshop
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
Locality In Distributed Graph Algorithms
Presentation transcript:

The Theory of Zeta Graphs with an Application to Random Networks Christopher Ré Stanford

Social Network Data Social network data is ubiquitous and high value. Since 2000, many studies of the dynamics of these graphs, Watts-Strogatz, Preferential Attachment, etc. Design new random graph models to capture some new aspect of an observed network. Above is not the goal of this work…

What’s the matter with Erdös-Rényi? G(N,p) does not match real-world graphs (degree distribution, diameter) G(N,p) does not match real-world graphs (degree distribution, diameter) But we have a beautiful theory of G(N,p) (zero-one laws, the “movie”, threshold phenomenon, ….) Much of this work enabled by simple, declarative G(N,p). Find an ER-like model to replace generative models for DB theory-style theorems? May lead to rigorous hypothesis testing for these models (key question in motifs).

Which model should we study? “At each time step, a new vertex is added. Then, with probability δ, two vertices are chosen uniformly at random and joined by an undirected edge.” – CHKNS Many models. For this study: simple & popular. Callway, Hopcroft, Kleinberg, Newman, Strogatz (CHKNS) CHKNS captures one salient aspect of many models: Arrival order of node affect its properties. NB: Does not capture all phenomenon of interest.

Zeta Graphs Simple model to capture “arrival order” NB: We’ll use a directed variant, all queries are binary since its easier to describe.

Zeta graphs Bare bones model to break symmetry: 1 connects to many nodes (~ log N). N connects to 1 node (in expectation) Bare bones model to break symmetry: 1 connects to many nodes (~ log N). N connects to 1 node (in expectation) ER-like: Edges are present independently. Zeta graphs are a family of sets of graphs indexed by N Fixed node set: [N] = {1,…,N} (Index ≈ arrival order) Stochastic edge set (independent edges)

Informal Main Result Conjunctive Graph Queries cannot distinguish between Zeta graphs and CHKNS as N to ∞. 1. Determine the Theory of Zeta Graphs 2. Show the Theory of CHKNS is sandwiched between two “slices” of Zeta Graphs. Here, Theory is set of CQs with probability 1

1 st Technical Challenge: Graph Patterns

Our goal for this section Given (1) a Language of Boolean queries L, and (2) a family of probability models M(1), M(2), …,M(N) check if lim N to ∞ Pr M(N) [q] = 1 for q in L Given (1) a Language of Boolean queries L, and (2) a family of probability models M(1), M(2), …,M(N) check if lim N to ∞ Pr M(N) [q] = 1 for q in L For the talk: (1)L will be “graph patterns” positive conjunctive queries over binary relations. (2)The family of probability models M(N)= “Theory” Th(L,M) = { q in L : lim N to ∞ Pr M(N) [q] = 1 }

Boolean Query Answering on ER Graphs (2) Compute expected number of tuples. (1) Form “full query” corresponding to q. (3) Use Janson’s Inequality to relate E[Q] to Pr[q]

Recall: Classical Janson’s Inequality A classical sufficient condition for Pr[q] to 1. A Q(c) and Q(d) properly overlap if they are not identical, but they share at least one identical subgoal see Alon & Spencer, Random Graphs A corollary of Janson’s inequality is:

Boolean Query Answering on ER Graphs (2) Compute expected number of tuples. (1) Form “full query” corresponding to q. (3) Use Janson’s Inequality to relate E[Q] to Pr[q] What changes for Zeta graphs?

Computing Expectation

Multiple Valued Zeta (MVZ) Functions Only use integer s i in this talk MVZs show up in some strange places…

Order Matters: Paths of Length 2 If x < y < z If x < z < y So in our “atoms” variables will be totally ordered

Why Multiple-Valued Zeta (MVZ)? Well-studied special function. We get for free: 1.Asymptotics [Costermans et al. 2005] 2.Algebraic Identities [Zudilin & Zudilin 2003] 3.Fancy sounding function (not helpful)

Asymptotic Estimates for MVZs This is a small variation of Costermans et al. result. (expected # of edges) (expected # of triangles) (expected # of K 4 )

Indicates shared identical goal Pr[2 Paths] Consider pairs of properly overlapping 2 paths. And others o(E[Q] 2 ) and since E[Q] =  (1), Pr[Q] = 1 – o(1) …

Two cycles you’re out! r cycle r cycle s cycle s cycle (1) For all r, s ≥ 2, Pr M(N) [ B(r,s) ] 0 as N to ∞, i.e., no bicycles. B(r,s) (2) Any connected graph q with at most one cycle appears with probability 1. 1 st result: Two Parts: (A) Any individual pattern, check E, and (B) Different “orderings” are non-negatively correlated.

Back to CHKNS

Central Message How different is CHKNS from the family of Zeta graphs? Up to CQs, the answer is not at all.

Key Technical Issues 1. CHKNS Edge probabilities have a painful form. – But can be sandwiched by “Zeta slices” 2. CHKNS Edges are correlated! - Develop bounds on correlations 3. Show that CHKNS can be essentially embedded in a part of Zeta graphs. Goal: Establish that Th(“Graph Patterns”, CHKNS ) = Th(“Graph Patterns”, Zeta Graphs)

Other Related Work Graph Models. Huge amounts. Volumes! [Lynch 05]: Conditions on a skewed degree distribution, but symmetrizes labels. Proves a 0-1 law for all of FO! Zeta graphs and CHKNS have no 0-1 law. Inspired by this paper!

Future Work & Conclusion “Conjunctive” theory of simple random graph models with order. Does a simpler model capture CHKNS? Could one capture Albert & Barabasi’s preferential attachment model? Richer Languages?

Expectations for Ordered Graphs Since sensitive to order, consider graph patterns with order among variables. Then expectation has a semi-closed form. This function has an MVZ

Computing Expectations of General CQs If variables in Q are totally ordered, then we can compute E[Q] using MVZs. Obvious algorithm: given a query, add in equality and inequality in all possible ways. This takes exponential time in Q (#P-hard).