Download presentation
Presentation is loading. Please wait.
Published byJody Hamilton Modified over 9 years ago
1
CcaEcloud Phase I Wrap-up Phase I Doe SBIR Stefan Muszala, PI DOE Grant No DE-FG02-08ER85152 Tech-X Corporation Boulder, CO Updates: onRamp, FACETS+Babel, Babel Structs
2
Tech-X Corporation Particle accelerator programs play a significant role in 14 out of 28 DOE laboratories which span a number of DOE offices such as the Offices of High Energy Physics (HEP), Nuclear Physics (NP) and Basic Energy Sciences (BES) (Facilities for the Future of Science) Accelerator simulation is required throughout the life- cycle of accelerators in four areas -Design -Analysis -Optimization -Upgrading Accelerator simulations play vital near-, medium-, and long-term roles
3
Tech-X Corporation Software reuse and common interfaces Ability to compose simulations Portability Mixed language programming interoperability Performance analysis of composed simulations High-performance accelerator software should allow complex applications while promoting good software engineering practices
4
Tech-X Corporation Channel Driver Component for isolating space charge kick calculation Tweaked SIDL interfaces over what was used for the electron cloud component Apply performance analysis Model 1 processor performance while increasing problem size Model 1-4 processor performance (two dual core AMD Opterons) Modifications to Bocca by Stephen Tramer Splicer block protection Bocca change Bocca copy Since the last CCA meeting finished Phase I work and prepared for Phase II
5
Tech-X Corporation The original Synergia2 channel driver exercised different space charge routines and provides a concise test-bed for a CCA implementation Results are comparable Horizontal Width (M) Longitudinal position (M)
6
Tech-X Corporation After Instrumentation we can see Solve and Kick behavior even with substantial call overhead
7
Tech-X Corporation The source of overhead is due to existing Synergia2 method structure
8
Tech-X Corporation Single Processor Performance of the space charge calculation goes as as N 3 Number of Particles = N 3, N=grid size, particles/cell=1 Core Work consists of two triple-nested for loops over 3-dimensions: 6(N 3 ) T 1 =T u 6(N 3 ) + T u = {min | max | average} for cell update. We use min. need to study why and min T u )
9
Tech-X Corporation Multi-Processor Performance starts with Amdahl’s law Start with Amdahl’s law T P = S + Q/P but let f= S/(S + Q) be the fraction of serial work. Amdahl’s law is now: T P = fT 1 + (1 − f )T 1 /P Account for Communication for PEs > 0 Substitute for T 1 and T comm T u = {min | max | average} for cell update. From serial model We use min. for now. Need to actually quantify f (cycle and instruction count), understand messaging better
10
Tech-X Corporation Multi-Processor Performance model matches real data for a 32 3 size problem (32,768 particles)
11
Tech-X Corporation Stephen Tramer was able to add features to Bocca Bocca splicer block protection when using bocca-merge in target files ---no-preserve option -4 regression tests -merging a protected block inside a key block, -ignoring a protected block outside a key block, -standard merge -preservation turned off Bocca change now supports: –--remove-implements –--remove-requires Bocca copy operation: –may now create exact duplicates of {component, class, interface, port, enum}
12
CcaEcloud Phase I Wrap-up Phase I Doe SBIR Stefan Muszala, PI DOE Grant No DE-FG02-08ER85152 Tech-X Corporation Boulder, CO Updates: onRamp, FACETS+Babel, Babel Structs
13
Tech-X Corporation Update on other projects using CCA tools OnRamp: –Need to write prototype with onRamp mini-tutorial and autotools –Possible use in Synergia2/TxPhysics as well as in FACETS transport model integration FACETS transport model integration: –Will work with Tom to implement an alternative struct passing mechanism Legacy Fortran allocates memory as the callee, reluctance to change contributed codes to fix this Concern over F2003 compilers available on Franklin and Jacquard in the near term Did not want to re-write existing f90 code for using the Array type Babel doesn’t support arrays of structs (and deeper nesting) so needed derived type refactor in transport models if using current Babel struct implementation Babel Structs: –Few hours so far. By end of May should be well on the way to implementing F77 & F90 struct support and tests. Java struct support and tests are next target
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.