Funded by US-DOE grants DE-FG02-05ER84172,

Slides:



Advertisements
Similar presentations
Steady-state heat conduction on triangulated planar domain May, 2002
Advertisements

4 th order Embedded Boundary FDTD algorithm for Maxwell Equations Lingling Wu, Stony Brook University Roman Samulyak, BNL Tianshi Lu, BNL Application collaborators:
Plasma Medicine in Vorpal Tech-X Workshop / ICOPS 2012, Edinburgh, UK 8-12 July, 2012 Alexandre Likhanskii Tech-X Corporation.
Parallel Jacobi Algorithm Steven Dong Applied Mathematics.
Maxwell’s Equations from Special Relativity James G. O’Brien Physics Club Presentation University Of Connecticut Thursday October 30th, 2008.
1 Spring 2003 Prof. Tim Warburton MA557/MA578/CS557 Lecture 29.
Algorithm Development for the Full Two-Fluid Plasma System
1 Electromagnetism We want to apply the reaction theory developed in the first few lectures to electronuclear interactions. It is worthwhile reviewing.
5/4/2015rew Accuracy increase in FDTD using two sets of staggered grids E. Shcherbakov May 9, 2006.
University of Colorado at Boulder Center for Integrated Plasma Studies Two-Fluid Applications of NIMROD D. C. Barnes University.
Ray theory and scattering theory Ray concept is simple: energy travels between sources and receivers only along a “pencil-thin” path (perpendicular to.
EMLAB 1 Numerical Boundary Conditions. EMLAB 2 A Problem at the Boundary of the Grid We must implement the update equations for every point in the grid.
Discrete Exterior Calculus. More Complete Introduction See Chapter 7 “Discrete Differential Forms for Computational Modeling” in the SIGGRAPH 2006 Discrete.
1.2 Row Reduction and Echelon Forms
Linear Equations in Linear Algebra
Coupled Fluid-Structural Solver CFD incompressible flow solver has been coupled with a FEA code to analyze dynamic fluid-structure coupling phenomena CFD.
Can We Determine Electric Fields and Poynting Fluxes from Vector Magnetograms and Doppler Shifts? by George Fisher, Brian Welsch, and Bill Abbett Space.
EM Computations with Embedded Boundaries (cut cells) C. Nieter †, J.R. Cary †* (presenter), G.R. Werner *, D.N. Smithe †, P.H. Stoltz †, S. Ovtchinnikov.
Estimating Electric Fields from Sequences of Vector Magnetograms George H. Fisher, Brian T. Welsch, William P. Abbett, and David J. Bercik University of.
Estimating Electric Fields from Vector Magnetogram Sequences G. H. Fisher, B. T. Welsch, W. P. Abbett, D. J. Bercik University of California, Berkeley.
An Introduction to Breakdown Simulations With PIC Codes C. Nieter, S.A. Veitzer, S. Mahalingam, P. Stoltz Tech-X Corporation MTA RF Workshop 2008 Particle-in-Cell.
© 2003 by Davi GeigerComputer Vision October 2003 L1.1 Structure-from-EgoMotion (based on notes from David Jacobs, CS-Maryland) Determining the 3-D structure.
Chapter 1 Vector analysis
Parabolic PDEs Generally involve change of quantity in space and time Equivalent to our previous example - heat conduction.
Computational Fluid Dynamics I PIW Numerical Methods for Parabolic Equations Instructor: Hong G. Im University of Michigan Fall 2005.
Finite Difference Time Domain Method (FDTD)
PHY 042: Electricity and Magnetism
Plasma Dynamics Group Aerospace & Energetics Research Program Boundary Conditions for MHD Resistive MHD equations in weakly conservative form (balance.
Physics of Fusion power Lecture4 : Quasi-neutrality Force on the plasma.
Applications of Frequency Extraction to Cavity Modeling Travis M. Austin* and John R. Cary*,¶ Tech-X Corporation*, University of Colorado-Boulder ¶ Collaborator:
Implementation of 2D FDTD
Finite Differences Finite Difference Approximations  Simple geophysical partial differential equations  Finite differences - definitions  Finite-difference.
Numerical Computations in Linear Algebra. Mathematically posed problems that are to be solved, or whose solution is to be confirmed on a digital computer.
Venkatram Ramanathan 1. Motivation Evolution of Multi-Core Machines and the challenges Summary of Contributions Background: MapReduce and FREERIDE Wavelet.
Fall 2014 Notes 23 ECE 2317 Applied Electricity and Magnetism Prof. David R. Jackson ECE Dept. 1.
Most physically significant large-scale atmospheric circulations have time scales on the order of Rossby waves but much larger than the time scales.
Challenging problems in kinetic simulation of turbulence and transport in tokamaks Yang Chen Center for Integrated Plasma Studies University of Colorado.
ΑΡΙΘΜΗΤΙΚΕΣ ΜΕΘΟΔΟΙ ΜΟΝΤΕΛΟΠΟΙΗΣΗΣ 4. Αριθμητική Επίλυση Συστημάτων Γραμμικών Εξισώσεων Gaussian elimination Gauss - Jordan 1.
Antenna Modeling Using FDTD SURE Program 2004 Clemson University Michael Frye Faculty Advisor: Dr. Anthony Martin.
Sept COMP60611 Fundamentals of Concurrency Lab Exercise 2 Notes Notes on the finite difference performance model example – for the lab… Graham Riley,
VORPAL Optimizations for Petascale Systems Paul Mullowney, Peter Messmer, Ben Cowan, Keegan Amyx, Stefan Muszala Tech-X Corporation Boyana Norris Argonne.
Thermal-ADI: a Linear-Time Chip-Level Dynamic Thermal Simulation Algorithm Based on Alternating-Direction-Implicit(ADI) Method Good afternoon! The topic.
Scalable Symbolic Model Order Reduction Yiyu Shi*, Lei He* and C. J. Richard Shi + *Electrical Engineering Department, UCLA + Electrical Engineering Department,
7. Introduction to the numerical integration of PDE. As an example, we consider the following PDE with one variable; Finite difference method is one of.
Stable, Circulation- Preserving, Simplicial Fluids Sharif Elcott, Yiying Tong, Eva Kanso, Peter Schröder, and Mathieu Desbrun.
OVERFLOW Using OVERFLOW version 2.0y Shared, distributed and hybrid parallel processing schemes available with MPI and OpenMP Wide range of applications:
Jungpyo Lee Plasma Science & Fusion Center(PSFC), MIT Parallelization for a Block-Tridiagonal System with MPI 2009 Spring Term Project.
1 EEE 431 Computational Methods in Electrodynamics Lecture 8 By Dr. Rasime Uyguroglu
Modeling Electromagnetic Fields in Strongly Inhomogeneous Media
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Normal Equations The Orthogonality Principle Solution of the Normal Equations.
TITLE Stuart R.Hudson, D.A.Monticello, A.H.Reiman, D.J.Strickler, S.P.Hirshman, L-P. Ku, E.Lazarus, A.Brooks, M.C.Zarnstorff, A.H.Boozer, G-Y. Fu and G.H.Neilson.
1 Chapter 7 Numerical Methods for the Solution of Systems of Equations.
1 EEE 431 Computational Methods in Electrodynamics Lecture 7 By Dr. Rasime Uyguroglu
Formulation of 2D‐FDTD without a PML.
ECE 576 – Power System Dynamics and Stability Prof. Tom Overbye Dept. of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
1 Spring 2003 Prof. Tim Warburton MA557/MA578/CS557 Lecture 28.
1 1.2 Linear Equations in Linear Algebra Row Reduction and Echelon Forms © 2016 Pearson Education, Ltd.
8. Wave Guides and Cavities 8A. Wave Guides Suppose we have a region bounded by a conductor We want to consider oscillating fields in the non-conducting.
Copyright ©2008, Thomson Engineering, a division of Thomson Learning Ltd.
PIVOTING The pivot or pivot element is the element of a matrix, or an array, which is selected first by an algorithm (e.g. Gaussian elimination, simplex.
Chapter 30.
THE METHOD OF LINES ANALYSIS OF ASYMMETRIC OPTICAL WAVEGUIDES Ary Syahriar.
Prof: Ming Wu GSI: Kevin Han Discussion 1/17/18
L Ge, L Lee, A. Candel, C Ng, K Ko, SLAC
Linear Equations in Linear Algebra
Wavelet “Block-Processing” for Reduced Memory Transfers
Faraday’s law Accelerated electron creates time-varying current, time-varying current creates time-varying magnetic field, time-varying magnetic field.
Introduction: A review on static electric and magnetic fields
Comparison of CFEM and DG methods
Linear Equations in Linear Algebra
Presentation transcript:

Funded by US-DOE grants DE-FG02-05ER84172, Dey-Mittra ADI: An absolutely stable method for Maxwell’s Equations with Embedded Boundaries Travis Austin John Cary, David Smithe Tech-X Corporation Funded by US-DOE grants DE-FG02-05ER84172, DE-FC02-07ER41499, and FA9451-06-D-0115/002. Serguei Ovtchinnikov and Johan Carlsson have also done some work on ADI in addition to Chet Nieter and Peter Stoltz who have tested out the method in VORPAL. ComPASS Meeting Boulder, CO Tuesday, October 6, 2009

Overview Background Motivation Dey-Mittra Cut-Cell Stability Alternating-Direction Implicit (ADI) Methods Divergence-preserving Tridiagonal Solves (Smithe, Cary, Carlsson, Ovtchinnikov) Dey-Mittra ADI Method Implementation Frequency Extraction Results Performance Future Work Present overview relatively quickly.

Background Maxwell’s Equations: Yee Method: Bz Ey Ez Bx By Ex Remember: Click brings up second equation for Maxwell’s. Explain that P contains positive derivative terms and M contains negative derivative terms. Ex Ez Ey Bx By Bz 3

Background Yee Method (Faraday’s Law): Courant-Friedrichs-Lewy Stability Condition: Faraday’s Law for the By component where Axz is area of face and lx and lz are edge lengths. 4

Motivation Dey-Mittra Cut-Cell Stability Start by asking rhetorical question: Why consider ADI for Maxwell’s Equations? An embedded boundary can create cells with only a fraction of their volume inside of the computational domain. (Point to four corner cells.) These cells with fractional volumes lead, via stability considerations, a significantly reduced time step to maintain stability. Stairstep does not lead to these restrictions but method is only first-order as it inaccurately approximates the boundary. (a) Dey-Mittra Approach (b) Stairstep Approach - Only change Faraday update 5

Motivation Dey-Mittra Cut-Cell Stability Dey-Mittra-Induced Stability Condition: Determination of fDM: fractional value between 0 and 1 based on stability derived from Gershgorin circle theorem too small cut-cells make time step prohibitively small an implicit method would overcome these restrictions A fractional factor, fDM, is defined which, via Gerchgorin results, leads to a reduction in the time step and the excluding of “too-small” cells to maintain stability. We need an implicit method to overcome these stability considerations. 6

Time Step Based on Accuracy Motivation Time Step Based on Accuracy Smithe’s Rule: 6 temporal points/period => good picture 8 temporal points/period => beat theoretician 12 temporal points/period => build something Explicit FDTD requires us to overresolve the temporal aspect of the simulation because of stability considerations. Desire an implicit approach that is not too costly and permits the temporal resolution to be governed by accuracy and not stability. Explicit FDTD at high spatial resolution requires overresolution of temporal space because of CFLcondition. Again, we need an implicit method which is limited by accuracy and not stability. 7

Alternating-Direction Implicit Methods Divergence-Preserving Form There are four 2nd-order accurate variants of the ADI algorithm, depending on the order of the operands: ZCZ is the first. We investigated the last … Mention work of Smithe, Carlsson, and Cary and funded by a DOE SBIR Phase I in 2007.

Alternating-Direction Implicit Methods Divergence-Preserving Form The operator P+M is the curl operator, so for Yee-cell Of the four ADI combinations, only the last form, DP, can be algebraically manipulated to show that its final operation is equivalent to a finite-difference curl Thus it is divergence preserving, for source S.

Alternating-Direction Implicit Methods Divergence-Preserving Form Full details are in the paper D. N. Smithe, J. R. Cary, J. A. Carlsson, ”Divergence preservation in the ADI algorithms for electromagnetics,” J. of Comp. Physics 228, 7289 (2009).

Alternating-Direction Implicit Methods Tridiagonal Solves Each domain does forward-solve over its domain Passes boundary data to single process global solve Receives data, then back-solves over its domain   Colored Blocks correspond to (EU^-1)*(L^-1D) which is the serial bottleneck matrix. 

Alternating-Direction Implicit Methods Tridiagonal Solves Remedy is Concurrent Divide & Conquer In 2-D and 3-D there are multiple 1-D solves. Global solves are distributed across the processes. Reduced global solve is now a source of idle processor time. 

Alternating-Direction Implicit Methods Tridiagonal Solves For good scaling, the local backward solve must cover latency. N  # cells in process Longer 1-D dimension, N1/2 rather than N1/3, means more time to cover latency. 2-D scales easier than 3-D for multiprocessor distributed computing

Alternating-Direction Implicit Methods Tridiagonal Solves There was good scaling as long as Ncells(1/Ndim)  64.  Implies typically good scaling for ADI in both 2-D and 3-D. On office linux cluster, needed Ncells(1/Ndim)  128 for good scaling  Implies good scaling for 2-D, and marginal scaling for 3-D. (Ideal = dotted line) Scaling study on BG/L was favorable

Dey-Mittra ADI Implementation Accelerator devices have boundaries which are nonconvex. This breaks each row tri-diagonal solve into several solves. Simply unit-fill the diagonals of rows for fully exterior field components, and set RHS source to zero.

Dey-Mittra ADI Implementation Dey-Mittra is a metallic cut-cell algorithm giving 2nd-order accurate global solutions. Modify Faraday’s law to use non-metallic electric line length and magnetic flux area. What is the Dey-Mittra Cut-Cell Algorithm? dAnon-metallic dlnon-metallic

Division by small area limits algorithm. Dey-Mittra ADI Implementation Division by small area limits algorithm.  Reduction in time step is significant, 0.5Dt for decent results, and 0.25Dt or even 0.10Dt for excellent results.  Throws away small cells, leading to occasional “pits” and “scratches” in geometry including particle creation/destruction surfaces. Using ADI can eliminate both these inconveniences. Dey-Mittra Cut-Cell Algorithm has two disadvantages

Dey-Mittra ADI Implementation Technically, the stability of ADI requires that the two alternating-direction curl matrix operators, P and M, be anti-symmetric. The Dey-Mittra length/area factors appear to destroy this anti-symmetry. However, anti-symmetry can be recovered by solving for re-scaled fields. Then removing scale factors. Bottom-line: OK to use Dey-Mittra difference matrix even in non-anti-symmetric matrices. E.g., it’s still stable. (Even when matrix element  )

Frequency Extraction Results Dey-Mittra ADI Frequency Extraction Results Testing modes of A6 magnetron. Frequency (Hz) ADI with Dey-Mittra is validated in VORPAL 1/Nx2

Frequency Extraction Results Dey-Mittra ADI Frequency Extraction Results 2nd-order accuracy verified, even for dt that is 8 times the normal Courant limit. ADI with Dey-Mittra is validated in VORPAL

Frequency Extraction Results Dey-Mittra ADI Frequency Extraction Results Method investigated in 3D for A15 cavity Magnetic Field (z-component) tf = 343.06 ps Time Steps = 2000 ADI with Dey-Mittra is validated in VORPAL Electric Field (x-component) Time Step = 2.0 DtCFL fDM = 0.0000001 => all cells kept in the simulation

Dey-Mittra ADI Performance: 2D Simulation Parameters: Single CPU Results 2D A6 Magnetron Benchmark 2500 (50x50) cells GMRES w/ Jacobi preconditioner 1.358 ns of simulation time Explicit: 1280 time steps ADI-1.0: 320 time steps ADI-2.0: 160 time steps ADI-4.0: 80 time steps ADI-8.0: 40 time steps ADI-16.0: 20 time steps ADI-32.0: 10 time steps ADI-64.0: 5 time steps MaxIts raised at 8x CFL from 100 to 500 and then to 1000 at 32x CFL and then to 5000 at 64x CFL ADI with Dey-Mittra is validated in VORPAL

Performance: Strong Scaling Dey-Mittra ADI Performance: Strong Scaling ADI with Dey-Mittra is validated in VORPAL

Full details are in the paper Dey-Mittra ADI Full details are in the paper T. M. Austin, J. R. Cary, D. N. Smithe, C. Nieter, ”Alternating Direction Implicit Methods for FDTD using the Dey-Mittra Embedded Boundary Method,” accepted in The Open Plasma Physics Journal.

Summary Motivation for ADI methods was discussed in the context of Dey-Mittra method. Efficient parallel tridiagonal solves were presented and performance was verified. Implementation of ADI for the Dey-Mittra method was introduced and results showed stability beyond CFL. Argument made for fast tridiagonal solves. Multiprocessor? GPUs?