Estimating Networks With Jumps

Estimating Networks With Jumps
Mladen Kolar ( ) Carnegie Mellon University With Eric P. Xing

The Internet [wikipedia]

Twitter

Biological regulatory network

Metabolic network In this image we show the integrated metabolic and originated physical enzyme-enzyme interaction network of the Mycobacterium tuberculosis. We created them separately and then integrated it into a single network. Nodes are the enzymes catalyzing reactions, the red directed edges are the metabolic pathway and the blue undirected edges are predicted physical interactions. [Daniel Banky, ISMB09]

Networks are ubiquitous in sciences
… and nowadays worldwide

Networks are mathematical abstractions of complex systems
Networks are useful for visualization discovery of regularity patterns exploratory analysis ... of complex systems.

Current practices in network modeling
Probabilistic graphical models used for exploring networks Ising models Gaussian graphical models contains both structure and parameters

Interpretation of Markov Random Fields
A network can be obtained by drawing links between nodes that are conditionally independent.

High dimensional inference
Applications in many domains have large number of variables and small number of observations To avoid curse of dimensionality the models are assumed to have a low-dimensional structure for example, a small number of non-zero parameters – sparsity of the precision matrix

Rich literature on estimation of MRFs
Yuan & Lin, 2006; Meinshausen and Buhlmann, 2006; d’Aspremont et al., 2007; Bickel & Levina, 2007; El Karoui, 2007; Rothman et al., 2007; Zhou et al., 2007; Friedman et al., 2008; Lam & Fan, 2008; Ravikumar et al., 2008; Zhou, Cai & Huang, 2009; Peng et al. 2009; Guo et al., 2010; …

Estimating sparse Gaussian MRF models
Penalized maximum likelihood approach Neighborhood selection

Relating Neighborhood Selection to Linear regression
Partial correlation represents correlation between two variables conditioned on the rest

Graph Regression Neighborhood selection

Graph Regression

Drawbacks of current approaches
Many systems of interest cannot be explained by one static network model. There is a need for models that explain dynamical systems.

Estimating Time-Varying Networks

Networks with jumps

Temporally Smoothed Graph Regression (TESLA)
[Ahmed and Xing, 2009] …

Improved Estimation Procedure
Estimation of neighborhood of a node Loss Penalty

The structure of the penalty
Structural changes Sparsity

Optimization Convex problem
Non-smooth penalty term presents difficulties Smoothing technique (Nesterov, 2005) Smooth approximation of

Optimization (II) Accelerated gradient applied to the smoothed problem
iteratively solves It can be shown that the algorithm converges as

Tuning parameter selection
Optimizing the Bayesian information criterion

Simulation results (Chain)

Simulation results (NN)

Definition of the model
The model where defines a block with being block boundaries

Assumptions A1 There exist two constants and such that A2 Variables are scaled so that

Assumptions (II) A3 There exists a constant A4 There exists a constant

Assumptions (III) A5 The sequence of partition boundaries satisfy , where is a fixed, unknown sequence of the boundary fractions belonging to [0, 1].

Consistent estimation of fraction boundaries
Under A1 – A5, with the number of blocks known, we can show that with for some , if the following holds Here the minimal size of the jump measured as

Proof strategy The proof hinges on the analysis of the optimality conditions where and We show that events occur with low probability, otherwise the optimality conditions cannot be satisfied.

Consistent estimation of fraction boundaries (II)
Under the same regularity conditions, with an upper bound on the number of blocks known The distance h(., .) is defined as

Structural consistency
We have shown that the partition boundaries can be estimated consistently. Under the same regularity condition we can further show that for blocks that have “enough” samples

Proof strategy The proof uses techniques developed in Meinshausen and Buhlmann, 2006; Peng et al and Wainwright 2009 The main difficulty that differs from the existing work is controlling the bias that arises from estimating the partition boundaries.

Discussion Confidence on the estimated networks Stability Applications to streaming data and online learning

Thank you!

Estimating Networks With Jumps

Similar presentations

Presentation on theme: "Estimating Networks With Jumps"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Estimating Networks With Jumps

Similar presentations

Presentation on theme: "Estimating Networks With Jumps"— Presentation transcript:

Similar presentations

About project

Feedback