Andreas Herten

@andih
317 Followers
271 Following
177 Posts
My web is my castle • High-Performance Computing with GPUs at Jülich Supercomputing Centre
Lab Bloghttps://x-dev.pages.jsc.fz-juelich.de/
Personal Websitehttps://www.andreasherten.de
Finally, #JUPITER crossed the 1 ExaFLOP/s threshold today. The list is lying to you, though, it's not like it's 1000 PFLOP/s exactly, it's 1000.184 PFLOP/s; the rest got lost to rounding.
The 184 TFLOP/s are pretty much exactly the same as the previous #JSC supercomputer JUGENE from 2008. Incredible, that this gets lost in the margins only 17 years later.
https://mastodon.social/deck/@fzj_jsc@social.fz-juelich.de/115566616685186256
#HPC #exa_JUPITER
I ended up presenting five distinct slidedecks at #ISC25. If I find time, about what should I blog?
Here are title slides. Luckily, I needed to use PowerPoint only once 😊.
JUPITER is in the TOP500! We didn’t manage to beat the exascale barrier just yet, but did pretty good in the timeframe! Happy to chat about #JUPITER in the dedicated booth later! #HPC #Exa_JUPITER
Our #SC24 tutorial is in full swing. Onboarding 60 people rapidly to JEDI (the #JUPITER preparation system) is quite a challenge (and exciting)! #HPC
Presented at the „Arm in HPC“ BoF today. Great turn-out! Everyone was pretty happy with their Grace. #HPC #ISC24
Last night‘s Aurora was crazy! With naked eye only barely visible, but the 3 s exposure of the phone showed it well. #polarlichter #aurora
Our work on optimizing a Lagranian transport simulation (#MPTRAC) for #GPUs is online as a preprint: https://doi.org/10.5194/egusphere-2023-2547.
We tried something new to summarize the optimization in the form of a Sankey diagram. What do you think? I like it! #HPC
Accelerating Lagrangian transport simulations on graphics processing units: performance optimizations of MPTRAC v2.6

Abstract. Lagrangian particle dispersion models are indispensable tools for the study of atmospheric transport processes. However, Lagrangian transport simulations can become numerically expensive when large numbers of air parcels are involved. To accelerate these simulations, we made considerable efforts to port the Massive-Parallel Trajectory Calculations (MPTRAC) model to graphics processing units (GPUs). Here we discuss performance optimizations of the major bottleneck of the GPU code of MPTRAC, the advection kernel. Timeline, roofline, and memory analyses of the baseline GPU code revealed that the application is memory-bound and performance suffers from near-random memory access patterns. By changing the data structure of the horizontal wind and vertical velocity fields of the global meteorological data driving the simulations from Structure of Arrays (SoA) to Array of Structures (AoS), and by introducing a sorting method for better memory alignment of the particle data, performance was greatly improved. We evaluated the performance on NVIDIA A100 GPUs of the Jülich Wizard for European Leadership Science (JUWELS) Booster module at the Jülich Supercomputing Center, Germany. For our largest test case, transport simulations with 108 particles driven by the European Centre for Medium-Range Weather Forecasts (ECMWF) ERA5 reanalysis, we found that the runtime for the full set of physics computations was reduced by 75 %, including a reduction of 85 % for the advection kernel. In addition to demonstrating the benefits of code optimization for GPUs, we show that the runtime of CPU-only simulations is also improved. For our largest test case, we found a runtime reduction of 34 % for the physics computations, including a reduction of 65 % for the advection kernel. The code optimizations discussed here bring the MPTRAC model closer to applications on upcoming exascale high performance computing systems, and will also be of interest for optimizing the performance of other models using particle methods.

Our tutorial at #SC23 is in full swing! #HPC
We have this tradition in the west of Germany to fill all trains going to the airport with dressed up people up to maximum occupancy on 11 Nov. #roadtoSC #HPC
ECMWF Reading has really cool rooms. So official! #HPC #MAELSTROM