FLUCTATION SCALING: TAYLOR’S LAW AND BEYOND János Kertész Budapest University of Technology and Economics.

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

© 2003 Prentice-Hall, Inc.Chap 5-1 Business Statistics: A First Course (3 rd Edition) Chapter 5 Probability Distributions.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
The Global Digital Elevation Model (GTOPO30) of Great Basin Location: latitude 38  15’ to 42  N, longitude 118  30’ to 115  30’ W Grid size: 925 m.
STAT 497 APPLIED TIME SERIES ANALYSIS
ELEC 303 – Random Signals Lecture 18 – Statistics, Confidence Intervals Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 10, 2009.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Stochastic Event Capture Using Mobile Sensors Subject to a Quality Metric Nabhendra Bisnik, Alhussein A. Abouzeid, and Volkan Isler Rensselaer Polytechnic.
Aims: - evaluate typical properties in controlled model situations - gain general insights into machine learning problems - compare algorithms in controlled.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Chapter 10 Simple Regression.
1 17. Long Term Trends and Hurst Phenomena From ancient times the Nile river region has been known for its peculiar long-term behavior: long periods of.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Financial Networks with Static and dynamic thresholds Tian Qiu Nanchang Hangkong University.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
Chapter 7 Sampling and Sampling Distributions
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Analysis of Social Information Networks Thursday January 27 th, Lecture 3: Popularity-Power law 1.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman,
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson6-1 Lesson 6: Sampling Methods and the Central Limit Theorem.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Objectives (BPS chapter 11) Sampling distributions  Parameter versus statistic  The law of large numbers  What is a sampling distribution?  The sampling.
Hydrologic Statistics
Information Networks Power Laws and Network Models Lecture 3.
Objectives (BPS chapter 11) Sampling distributions  Parameter versus statistic  The law of large numbers  What is a sampling distribution?  The sampling.
Sampling distributions BPS chapter 11 © 2006 W. H. Freeman and Company.
Claudio Castellano CNR-INFM Statistical Mechanics and Complexity and
Dynamics of Coupled Oscillators-Theory and Applications
Models and Algorithms for Complex Networks Power laws and generative processes.
Separating internal and external dynamics of complex systems Marcio Argollo de Menezes Albert-László Barabási.
How and why things crackle We expect that there ought to be a simple, underlying reason that earthquakes occur on all different sizes. The very small earthquake.
Expanders via Random Spanning Trees R 許榮財 R 黃佳婷 R 黃怡嘉.
Critical Phenomena in Random and Complex Systems Capri September 9-12, 2014 Spin Glass Dynamics at the Mesoscale Samaresh Guchhait* and Raymond L. Orbach**
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Sampling distributions BPS chapter 11 © 2006 W. H. Freeman and Company.
Initial flux fluctuations of random walks on regular lattices Sooyeon YOON, Byoung-sun AHN, Yup KIM (Dept. of Phys.,Kyung Hee Univ.)
Complexity in Fisheries Ecosystems David Schneider Ocean Sciences Centre, Memorial University St. John’s, Canada ENVS 6202 – 26 Sept 2007.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
K. Ensor, STAT Spring 2005 Model selection/diagnostics Akaike’s Information Criterion (AIC) –A measure of fit plus a penalty term for the number.
Validating an Access Cost Model for Wide Area Applications Louiqa Raschid University of Maryland CoopIS 2001 Co-authors V. Zadorozhny, T. Zhan and L. Bright.
Workshop on Optimization in Complex Networks, CNLS, LANL (19-22 June 2006) Application of replica method to scale-free networks: Spectral density and spin-glass.
1 Jeffery T. Mitchell – Quark Matter /17/12 The RHIC Beam Energy Scan Program: Results from the PHENIX Experiment Jeffery T. Mitchell Brookhaven.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
PCB 3043L - General Ecology Data Analysis.
Asymptotic behaviour of blinking (stochastically switched) dynamical systems Vladimir Belykh Mathematics Department Volga State Academy Nizhny Novgorod.
Chapter 5 Sampling Distributions. Introduction Distribution of a Sample Statistic: The probability distribution of a sample statistic obtained from a.
ECE-7000: Nonlinear Dynamical Systems 2. Linear tools and general considerations 2.1 Stationarity and sampling - In principle, the more a scientific measurement.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Testbeam analysis Lesya Shchutska. 2 beam telescope ECAL trigger  Prototype: short bars (3×7.35×114 mm 3 ), W absorber, 21 layer, 18 X 0  Readout: Signal.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Networks interacting with matter Bartlomiej Waclaw Jagellonian University, Poland and Universität Leipzig, Germany.
Random Walk for Similarity Testing in Complex Networks
Link-Level Internet Structures
Forecasting Methods Dr. T. T. Kachwala.
Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband.
Chapter 7: Sampling Distributions
Review of Hypothesis Testing
Baosheng Yuan and Kan Chen
Sampling distributions
Statistics of Extreme Fluctuations in Task Completion Landscapes
Volume 111, Issue 2, Pages (July 2016)
Managing Flow Variability: Safety Inventory
Andreas Hilfinger, Thomas M. Norman, Johan Paulsson  Cell Systems 
复杂网络可控性 研究进展 汪秉宏 2014 北京 网络科学论坛.
Review of Hypothesis Testing
Presentation transcript:

FLUCTATION SCALING: TAYLOR’S LAW AND BEYOND János Kertész Budapest University of Technology and Economics

OUTLINE General observation: Fluctuation Scaling (FS) is ubiquitous in complex systems Examples from population dynamics, internet traffic, stock market, etc. Categorization: Temporal and ensemble FS Interesting effects: Window dependence, multiscaling Possible scenarios: - Central Limit Theorems - Strong driving - Random Walk with impact - Finite Size Scaling Summary

GENERAL OBSERVATION: FLUCTATION SCALING 1938 Fairfield Smith: For fixed size A of lands the average yield and the variance was measured. Varying A, the two quantities show scaling: with 1961 L.R. Taylor ( ) in Nature: ”Aggregation, variance and the mean”. Counted # of animals in given areas, and stated that (1) is a ”universal law” (today called Taylor’s law in population dynamics). Triggered more than 1000 studies. Widely accepted as one of the few universal laws in ecology. (1)

Fluctuation scaling for ensemble averages of the population of four species. Every point represents the mean and variance over an ensemble of areas of the same size A. The bottom dashed line corresponds to = 1/2, the top one to = 1. Points were shifted both vertically and horizontally for better visibility. Data from Taylor (1961).

Fluctuation scaling (FS): In a complex system the activity and its variance are related by a power law. Activity can be a global extensive characteristic (# animals in area). In this case systems with different parameters (e.g., area) are compared and ensemble averages are taken for each value of the parameter: Ensemble Fluctuation Scaling (EFS) Trivial example from physics: # of atoms in volume V. Further examples: # cells and its fluctuations of given organs in different species. Scaling over 10 orders of magnitude, ~ 1. # of tumor cells Human genome SNP-s (Single Nucleotide Polymorphisms)

Temporal Fluctuation Scaling Consider population time series in different habitats. The average and the varience of the time series scale like

Usually processes take place on networks –Internet –traffic networks –stock market Coupling to external world/drive present Multichanel observation: Activity measured on the i-th node (or link) : f i (t) Multichannel observations in complex systems

Highways f i (t) =traffic at a given point of a road i at day t. Daily traffic on 127 Colorado roads from 1998 to Computer chip f i (t) =state of a given logic component i at clock cycle t. 462 signal carriers 8,862 clock cycles. M. de Menezes and A.-L. Barabási,

Internet f i (t) = number of bytes passing through router i at time t. 347 routers t max =2 days (5 min. resolution) World Wide Web f i (t) = number of visits to website i at day t 3000 web sites. Daily visitation for a 30 day period M. de Menezes and A.-L. Barabási

1)For all i nodes: What can we learn from this? 2) Plotted results:

 i ~   = 1/2  = 1 Internetchip WWWhighway The scaling of fluctuations * M. de Menezes and A.-L. Barabási

Two universality classes? Simple random walk model: Network with broad degree-distribution N random walkers with finite lifetime and constant average number Activity at node i: Number of walkers in  t Control the fluctuations in N

Two universality classes? We have seen many for the ensemble averages. What about the time averages? Stock market data: Take a window of size  t=10min, and consider the volume of a stock i traded during this time activity Non-universal scaling over 6 orders of magnitude Eisler et al. 2005

Dependence on  t Duch and Arenas got  = 0.75 for Inernet * Note:

Multiscaling If in the scaling form  depends on q, we have multiscaling (stock market):

T: time average E: ensemble average

Many questions: What is special about  = 1/2 and 1? Are there universality classes? How to relate  t dependence to other types of scaling? How to relate time and ensemble averages? What are the possible scenarios? When is multiscaling expected? What is its origin? What is the ”physical” meaning of the crossovers? Corrections to scaling? …

 = 1/2 Simple case: Central Limit Theorem variance  sqrt(mean) Examples: Stat. phys. fluctuations, random walkers on a network with broad degree distribution and many more. Another route to  = 1/2: If the signal is 1 or 0 (# s) and  t is short enough such that no multiple events happen, then. E.g., for independent events Scaling possible only if spans many orders of magnitudes  p is small  hence  = 1/2

FS from the Enron database.  T depends on the time window  t and approaches 1/2 when  t goes to 0.

 = 1 I If there is synchronization in the system, mostly due to a strong external drive, the internal fluctuations become irrelevant and the major part of the fluctuations come from the drive itself, and the part of the noise going to the nodes is proportional to the signal at the nodes (which is also triggered by the drive). Simple random walk model: If the number of walkers in the system is fluctuating then a crossover from  = ½ to 1 takes place.

General   = 1/2 and  = 1 are not “universality classes in the stat. phys. sense. They are trivial extremes. Generally: 1/2 ≤  ≤ 1 How to obtain non-trivial  -s? -Impact inhomogeneity -Finite Size Scaling

Impact inhomogeneity Imagine that the random walkers make in impact on the node they visit, which is proportional to the (degree)  of the node. The activity now is the impact during the time window. There are two fluctuating quantities: N and V N From which follows: with limits ~ k 2β+1 ~ f 2  ~ 2  ~ k 2  (β+1)

Finite Size Scaling (FSS) Linear size of the system: L Number of ”spins”: L d Order parameter: M Susceptibility:  ~  2 FSS at criticality Hyperscaling:  =1 What if not M = N up – N down but just N up is looked at? N up = L d is extensive but with fluctuations like in  with 1/2 ≤  ≤ 1, e.g.,  MF = 3/4 SOC?

Binary forest model Consider a forest of N trees. In year t the reproductive activity (seed count) of a tree n is V n For simplicity we take V n = 1 with prob. p and 0 with prob. (1-p) Due to the relative shortness of the observation period N is const. The reproductive activity of forest exhibits long range correlations: From data fit: C(  n) ~ (  n) -0.4

The total reproductivity is From the correlations (for 1d) As the reproductivity is extensive  Correlations increase synchronization

Hurst exponent vs. FS If f i comes from time series,  i may scale with the length of the series as The dependence of  on  t implies a dependence of H on i: which is governed by the same . No universality! (E.g., dependence of H on capitalization.)

Limit theorems FS is always related to sums of random variables. We have seen that  = 1/2 comes from plain CLT In the language of limit theorems FS means Possible reasons to get nontrivial  : iid, but Levy stable distributions dependence of the variables (see, e.g. FSS)

Summary FS: general observation over many disciplines and systems FS: ensemble/temporal 1/2 ≤  ≤ 1 Trivial limiting cases (no universality classes) Scenarios to non-trivial  -s Limit theorems Review by Z. Eisler, I. Bartos and J. Kertesz: Adv. Phys. 57, (2008), arXiv: