Statistical Data Analysis: II

Slides:



Advertisements
Similar presentations
Data-Assimilation Research Centre
Advertisements

Ch2. Elements of Ensemble Theory
Lecture 13: Conformational Sampling: MC and MD Dr. Ronald M. Levy Contributions from Mike Andrec and Daniel Weinstock Statistical Thermodynamics.
Monte Carlo Methods and Statistical Physics
Lecture 3: Markov processes, master equation
Lecture 14: Advanced Conformational Sampling
Outline Formulation of Filtering Problem General Conditions for Filtering Equation Filtering Model for Reflecting Diffusions Wong-Zakai Approximation.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Stochastic Differential Equations Langevin equations Fokker Planck equations Equilibrium distributions correlation functions Purely dissipative Langevin.
Symmetry. Phase Continuity Phase density is conserved by Liouville’s theorem.  Distribution function D  Points as ensemble members Consider as a fluid.
Mechanics.
Course AE4-T40 Lecture 5: Control Apllication
The Monte Carlo Method: an Introduction Detlev Reiter Research Centre Jülich (FZJ) D Jülich
Monte Carlo Methods in Partial Differential Equations.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Advanced methods of molecular dynamics Monte Carlo methods
HYDRODYNAMIC MODES AND NONEQUILIBRIUM STEADY STATES Pierre GASPARD Brussels, Belgium J. R. Dorfman, College Park S. Tasaki, Tokyo T. Gilbert, Brussels.
F.F. Assaad. MPI-Stuttgart. Universität-Stuttgart Numerical approaches to the correlated electron problem: Quantum Monte Carlo.  The Monte.
Basic Monte Carlo (chapter 3) Algorithm Detailed Balance Other points.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
1 CE 530 Molecular Simulation Lecture 6 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
P T A typical experiment in a real (not virtual) space 1.Some material is put in a container at fixed T & P. 2.The material is in a thermal fluctuation,
Nonlinear localization of light in disordered optical fiber arrays
The Ideal Monatomic Gas. Canonical ensemble: N, V, T 2.
Part I Convergence Results Andrew Stuart John Terry Paul Tupper R.K
1 IEE5668 Noise and Fluctuations Prof. Ming-Jer Chen Dept. Electronics Engineering National Chiao-Tung University 03/11/2015 Lecture 3: Mathematics and.
Dissipative Particle Dynamics. Molecular Dynamics, why slow? MD solves Newton’s equations of motion for atoms/molecules: Why MD is slow?
Time-dependent Schrodinger Equation Numerical solution of the time-independent equation is straightforward constant energy solutions do not require us.
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Exploring the connection between sampling problems in Bayesian inference and statistical mechanics Andrew Pohorille NASA-Ames Research Center.
Lecture 20. Continuous Spectrum, the Density of States (Ch. 7), and Equipartition (Ch. 6) The units of g(  ): (energy) -1 Typically, it’s easier to work.
ارتباطات داده (883-40) فرآیندهای تصادفی نیمسال دوّم افشین همّت یار دانشکده مهندسی کامپیوتر 1.
Lecture 2 Molecular dynamics simulates a system by numerically following the path of all particles in phase space as a function of time the time T must.
1/18/2016Atomic Scale Simulation1 Definition of Simulation What is a simulation? –It has an internal state “S” In classical mechanics, the state = positions.
Discrete-time Random Signals
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION Kalman Filter with Process Noise Gauss- Markov.
Basic Monte Carlo (chapter 3) Algorithm Detailed Balance Other points non-Boltzmann sampling.
Lecture 14: Advanced Conformational Sampling Dr. Ronald M. Levy Statistical Thermodynamics.
6/11/2016Atomic Scale Simulation1 Definition of Simulation What is a simulation? –It has an internal state “S” In classical mechanics, the state = positions.
Chapter 6 Random Processes
STATISTICAL MECHANICS PD Dr. Christian Holm PART 5-6 Some special topics, Thermal Radiation, and Plank distribution.
The Monte Carlo Method/ Markov Chains/ Metropolitan Algorithm from sec in “Adaptive Cooperative Systems” -summarized by Jinsan Yang.
Numerical Solutions to the Diffusion Equation
Simulations Patrice Koehl Department of Biological Sciences
Lesson 8: Basic Monte Carlo integration
Wiener Processes and Itô’s Lemma
Q. M. Particle Superposition of Momentum Eigenstates Partially localized Wave Packet Photon – Electron Photon wave packet description of light same.
Diffusion over potential barriers with colored noise
12. Principles of Parameter Estimation
Overview of Molecular Dynamics Simulation Theory
SIGNALS PROCESSING AND ANALYSIS
The units of g(): (energy)-1
Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband.
PSG College of Technology
Boltzmann statistics Reservoir R U0 -  Combined system U0 = const
Markov chain monte carlo
Random WALK, BROWNIAN MOTION and SDEs
Quantum Two.
Advanced Molecular Dynamics
Biointelligence Laboratory, Seoul National University
Learning From Observed Data
Common Types of Simulations
Chapter 6 Random Processes
9. Two Functions of Two Random Variables
12. Principles of Parameter Estimation
Accelerator Physics Statistical Effects
Lesson 9: Basic Monte Carlo integration
Accelerator Physics G. A. Krafft, A. Bogacz, and H. Sayed
Accelerator Physics G. A. Krafft, A. Bogacz, and H. Sayed
Presentation transcript:

Statistical Data Analysis: II PART II 1. From trajectories to Probability Distribution Functions (PDF) 2. Sampling PDF with Particle Dynamics

Sampling

Sampling PDF’s with Particle Dynamics The ergodic theorem highlights an equivalence/duality between the Trajectory (Dynamics) versus PDF descriptions (Statistics). The question is how to sample a PDF using trajectories, i.e. Particle Dynamics. Since this is a major theme of Statistical Mechanics and Stochastic Optimization (search in hyperdimensional spaces), next we dig a bit deeper into this matter.

What is sampling? A sequence of numbers whose frequency reflects the given PDF with frequencies: is drawn times on a total of Such that:

Monte Carlo Sampling 1. Draw at random in X space 2. Compute 3. Draw a random 4. If : Accept Else: Reject and try again Repeat 1-4 until N acceptances Q: How about efficiency? N/Ntry can be very poor <10%

Dynamic Sampling (Trajectories) Generate a discrete trajectory Such that the time spent “around” Forms a sequence Such that where Plus: All values are accepted! Minus: The trajectory should visit all target space X

The Dirac Distribution By convoluting with a given function, picks up a single value Namely:

Dirac Transformation Rules By convoluting with a given function, picks up a single value Corresponding to zeros of the Dirac argument (crossing points): T1: More complicated: T2: Why? Because:

Dirac transformation rule Given the transformation The Dirac distribution transforms accordingly to: Namely: Useful identity:

Dynamical systems Consider a first-order equation of motion: With initial conditions: This is a mapping from x0 to x parametrized by time Solved by: The PDF generated by the single trajectory with Initial Condition x0 is: Dirac score Each crossing at x scores 1. Based on Dirac Rule T2:

Dynamical systems The corresponding PDF generated out of an ensemble of Initial Conditions x0 is: Dirac score:

Dynamic sampling via stochastic dynamics By now, we have learned how to sample using Deterministic trajectories . Since initial conditions matter, we may need MANY initial conditions, or a VERY LONG single trajectory (if the system is ergodic) Both can be inefficient. How about using stochastic dynamics?

PDF evolution The probability of finding a particle starting at x0 at t=t0 at position x at time t, is given by: By averaging over initial conditions: What is the evolution equation of this PDF? Therefore P obeys a continuity equation: Steady-state: As expected!

Stochastic Particle Dynamics We add noise: Stochastic Particle Dynamics Noise obeys Fluctuation-Dissipation Theorem: Each noise realization gives rise to its own trajectory. No need to change the initial condition. The formal solution is (Stratonovich):

Averaging over noise One can therefore define p(x;t) by integrating over both Initial conditions AND/OR Noise Distribution (typically gaussian). Noise can mitigate/eliminate averaging over initial conditions Differential Ito calculus: The mean is unaffected: The variance picks up a diffusive component:

Fokker-Planck equation Therefore P obeys now a continuity-diffusion equation, (Fokker-Planck Equation): Stationary (Equilibrium) distribution: Where: and Is the Partition Function Each of the PDF discussed so far can thus be linked to a dynamic stochastic process each with its own potential V(x) and diffusion D (temperature)!

Thermodynamic Partition Function For a thermodynamic system, x is 6N-dimensional vector, with N order Avogadro! Computing the PF is generally impossible other than for Gaussian ensembles (quadratic potential): Main analytical technique: Saddle Point Expand around local minima and perform the Gaussian integrals Main numerical technique: Metropolis Monte Carlo Perform a random walk in x-space. If the move lowers V(x) It is accepted. If it raises V(x), it is accepted with probability exp(-deltaV), deltaV>0. Also, Stochastic Particle Dynamics.

Molecular Dynamics Hamiltonian eqs of motion Energy = Hamiltonian function Closed orbit in phase-space (x,p) Each orbit corresponds to a different value of E Microcanonical: Canonical:

Sampling PDF’s with Molecular Dynamics If the system is ergodic, a single trajectory of span T is equivalent to M replicas of span T/M The deterministic particle dynamics is not effective in visiting large portions of phase space. Many initial conditions may be needed…

Sampling PDF’s with Langevin Dynamics If the system is ergodic, a single trajectory of span T Is equivalent to M replicas of span T/M The stochastic Langevin Dynamics is more effective In visiting larger portions of phase space

Which one is best suited to what? PDF versus SDE We are presented with a dual, equivalent representation of statistical phenomena: Statistical PDF versus Stochastic Particle Dynamics (SPD). Which one is best suited to what? SPD is more direct, it provides the detailed dynamics leading to the statistical distributions. However it entails heavy statistical averaging. Capturing rare events can be excruciating. PDF’s give directly the noise-free statistical distribution. However one must solve the FPE, which is generally less straightforward, especially as the number of dimensions increases. In d>3-6, SPD is the only option (Molecular Dynamics and/or MonteCarlo methods)

Appendix: A note on ergodicity Ergodic systems visit their entire phase-space X, so that the time spent on a given region R within X is proportional to the Size of that region (“measure” in math language) Ergodic Ergodicity is often assumed but hardly proven on rigorous grounds. Notable exception: Sinai Billiard

Broken-Ergodicity Ergodicity can be broken by teh existence of Additional invariants of motion, besides energy Broken-ergodicity, the white region is dynamically unaccessible Because it implies a change in the invariant Ensemble average: Many complex systems show broken ergodicity: Glasses, proteins…. Usually associated with highly corrugated free-energy landscapes Historically, broken ergodicity was first observed in the famous Fermi-Pasta-Ulam-Tsingou problem (see pdf file)

Ergodic Theorem E R Time average of a given observable A(x): G O N Ensemble average: Brackets denote average over Initial Conditions. First order dynamical system:

Assignements 1. Sample the Gaussian distribution with the first order stochastic dynamics xdot=-k*x + noise (Langevin equation) 2. Optional for the enthusiast: Compute the PDF generated by an harmonic oscillator by performing ensemble averaging over a set of initial conditions (positions and velocities) with the same energy E (micro-canonical ensemble). Hint: you must bin (x,p) phase space…

End of Lecture