Author: B. C. Bromley Presented by: Shuaiyuan Zhou 10-30-2009 Quasi-random Number Generators for Parallel Monte Carlo Algorithms.

Slides:



Advertisements
Similar presentations
Christian Lauterbach COMP 770, 2/16/2009. Overview  Acceleration structures  Spatial hierarchies  Object hierarchies  Interactive Ray Tracing techniques.
Advertisements

An Array-Based Algorithm for Simultaneous Multidimensional Aggregates By Yihong Zhao, Prasad M. Desphande and Jeffrey F. Naughton Presented by Kia Hall.
A NOVEL APPROACH TO SOLVING LARGE-SCALE LINEAR SYSTEMS Ken Habgood, Itamar Arel Department of Electrical Engineering & Computer Science GABRIEL CRAMER.
Fast Algorithms For Hierarchical Range Histogram Constructions
Data and Computer Communications Ninth Edition by William Stallings Chapter 12 – Routing in Switched Data Networks Data and Computer Communications, Ninth.
Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Randomized Algorithms Randomized Algorithms CS648 Lecture 6 Reviewing the last 3 lectures Application of Fingerprinting Techniques 1-dimensional Pattern.
© 2004 Goodrich, Tamassia Quick-Sort     29  9.
Sampling Attila Gyulassy Image Synthesis. Overview Problem Statement Random Number Generators Quasi-Random Number Generation Uniform sampling of Disks,
Reference: Message Passing Fundamentals.
CS 584. Review n Systems of equations and finite element methods are related.
Optimization and evaluation of parallel I/O in BIPS3D parallel irregular application Performance Modelling, Evaluation, and optimization of Parallel and.
1 Tuesday, November 14, 2006 “UNIX was never designed to keep people from doing stupid things, because that policy would also keep them from doing clever.
Steady Aeroelastic Computations to Predict the Flying Shape of Sails Sriram Antony Jameson Dept. of Aeronautics and Astronautics Stanford University First.
October, 1998DARPA / Melamed / Singh1 Parallelization of Search Algorithms for Modeling QTES Processes Joshua Kramer and Santokh Singh Rutgers University.
May 29, Final Presentation Sajib Barua1 Development of a Parallel Fast Fourier Transform Algorithm for Derivative Pricing Using MPI Sajib Barua.
CS 584. Sorting n One of the most common operations n Definition: –Arrange an unordered collection of elements into a monotonically increasing or decreasing.
Chapter 14 Simulation. Monte Carlo Process Statistical Analysis of Simulation Results Verification of the Simulation Model Computer Simulation with Excel.
Fast integration using quasi-random numbers J.Bossert, M.Feindt, U.Kerzel University of Karlsruhe ACAT 05.
CSCI-455/552 Introduction to High Performance Computing Lecture 22.
All-Pairs-Shortest-Paths for Large Graphs on the GPU Gary J Katz 1,2, Joe Kider 1 1 University of Pennsylvania 2 Lockheed Martin IS&GS.
DATA LOCALITY & ITS OPTIMIZATION TECHNIQUES Presented by Preethi Rajaram CSS 548 Introduction to Compilers Professor Carol Zander Fall 2012.
Efficient Pseudo-Random Number Generation for Monte-Carlo Simulations Using GPU Siddhant Mohanty, Subho Shankar Banerjee, Dushyant Goyal, Ajit Mohanty.
Random Number Generators CISC/QCSE 810. What is random? Flip 10 coins: how many do you expect will be heads? Measure 100 people: how are their heights.
Calculating Discrete Logarithms John Hawley Nicolette Nicolosi Ryan Rivard.
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
A Parallelisation Approach for Multi-Resolution Grids Based Upon the Peano Space-Filling Curve Student: Adriana Bocoi Advisor: Dipl.-Inf.Tobias Weinzierl.
Distributed Asynchronous Bellman-Ford Algorithm
A SCALABLE LIBRARY FOR PSEUDORANDOM NUMBER GENERATION ALGORITHM 806: SPRNG.
Chao “Bill” Xie, Victor Bolet, Art Vandenberg Georgia State University, Atlanta, GA 30303, USA February 22/23, 2006 SURA, Washington DC Memory Efficient.
Introduction to MCMC and BUGS. Computational problems More parameters -> even more parameter combinations Exact computation and grid approximation become.
Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.
MCSL Monte Carlo simulation language Diego Garcia Eita Shuto Yunling Wang Chong Zhai.
RNGs in options pricing Presented by Yu Zhang. Outline Options  What is option?  Kinds of options  Why options? Options pricing Models Monte Carlo.
Publication: Ra Inta, David J. Bowman, and Susan M. Scott. Int. J. Reconfig. Comput. 2012, Article 2 (January 2012), 1 pages. DOI= /2012/ Naveen.
Monte Carlo I Previous lecture Analytical illumination formula This lecture Numerical evaluation of illumination Review random variables and probability.
Experimental Method and Data Process: “Monte Carlo Method” Presentation # 1 Nafisa Tasneem CHEP,KNU
Accelerating Statistical Static Timing Analysis Using Graphics Processing Units Kanupriya Gulati and Sunil P. Khatri Department of ECE, Texas A&M University,
Green’s Function Monte Carlo Method in a nutshell.
Monte Carlo Methods1 T Special Course In Information Science II Tomas Ukkonen
Chapter 4 Stochastic Modeling Prof. Lei He Electrical Engineering Department University of California, Los Angeles URL: eda.ee.ucla.edu
Quasi-Monte Carlo Methods Fall 2012 By Yaohang Li, Ph.D.
SIGGRAPH 2011 ASIA Preview Seminar Rendering: Accuracy and Efficiency Shinichi Yamashita Triaxis Co.,Ltd.
INFSO-RI Enabling Grids for E-sciencE SALUTE – Grid application for problems in quantum transport E. Atanassov, T. Gurov, A. Karaivanova,
FORS 8450 Advanced Forest Planning Lecture 5 Relatively Straightforward Stochastic Approach.
Groundwater Modeling, Inverse Characterization, and Parallel Computing Kumar Mahinthakumar NC State University.
P-Tree Implementation Anne Denton. So far: Logical Definition C.f. Dr. Perrizo’s slides Logical definition Defines node information Representation of.
Data Structures and Algorithms in Parallel Computing Lecture 2.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Z Transform Primer. Basic Concepts Consider a sequence of values: {x k : k = 0,1,2,... } These may be samples of a function x(t), sampled at instants.
Parallel Algorithm for Multiple Genome Alignment Using Multiple Clusters Nova Ahmed, Yi Pan, Art Vandenberg Georgia State University SURA Cyberinfrastructure.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
Adaptive Mesh Applications Sathish Vadhiyar Sources: - Schloegel, Karypis, Kumar. Multilevel Diffusion Schemes for Repartitioning of Adaptive Meshes. JPDC.
Photo-realistic Rendering and Global Illumination in Computer Graphics Spring 2012 Hybrid Algorithms K. H. Ko School of Mechatronics Gwangju Institute.
CSCI-455/552 Introduction to High Performance Computing Lecture 23.
Structured ALE Solver. Overview Structured ALE mesh automatically generated Smaller input deck; Easier modifications to the mesh; Less I/O time. Shorter.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Basic Communication Operations Carl Tropper Department of Computer Science.
An Iterative Monte Carlo Method for Nonconjugate Bayesian Analysis B. P. Carlin and A. E. Gelfand Statistics and Computing 1991 A Generic Approach to Posterior.
01/26/05© 2005 University of Wisconsin Last Time Raytracing and PBRT Structure Radiometric quantities.
Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.
Cognitive Information Service Basic Principles and Implementation of A Cognitive Inter-Node Protocol Optimization Scheme Dzmitry Kliazovich Fabrizio Granelli.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Spatial Online Sampling and Aggregation
Lecture 4 - Monte Carlo improvements via variance reduction techniques: antithetic sampling Antithetic variates: for any one path obtained by a gaussian.
Parallel sorting.
Wellington Cabrera Advisor: Carlos Ordonez
Computational issues Issues Solutions Large time scale
Sampling Plans.
Presentation transcript:

Author: B. C. Bromley Presented by: Shuaiyuan Zhou Quasi-random Number Generators for Parallel Monte Carlo Algorithms

 Why & How  Sobol’ sequence & subsequences  Parallel Algorithm  Performance issue

The will and the way Enable parallel systems to take full advantage of the benefits of quasi-random Monte Carlo algorithms Rapid sequence production. as quick as any pseudorandom generator Fast convergence rate. much faster than pseudorandom methods An algorithm of generating quasi-random numbers in parallel systems, using a leapfrog scheme

Main contribution “A recursion relation which allows an element in a Sobol’ sequence to be quickly calculated from a previous, but not necessarily adjacent, element without determining all of the intervening members of the sequence”. Enables parallel Monte Carlo algorithms to have each node of a parallel processor step through interleaved subsequences with the same computational load as if it were calculating the original sequence, without any internode communication.

Sobol’ sequence

Recursive computation

Implementing in parallel In parallel applications, Breaking up the spatial domain of integration in the M- dimensional cube among processing nodes Distributing the sequence of sample points among processing nodes Leapfrog technique: each node skips along the sequence, jumping over those sample points which are handled by other nodes.

Modification to recursion

Algorithm

Algorithm – Cont.

Performance issue The efficiency of the leapfrog algorithm is highest when the number of nodes is an integral power of 2. Tests with several parallel supercomputers demonstrate that as many as 10 6 integration points (up to 6 dimensions) can be generated per second per node in the optimal case. When the number of nodes is P, the computational load of generating interleaved sequences is, at worst, proportional to the number of nonzero bits in P.

Thank you!