Can Large Language Models Predict Parallel Code Performance?

#CUDA #OpenMP #LLM #Performance #Benchmarking #Package

https://hgpu.org/?p=29903

Can Large Language Models Predict Parallel Code Performance?

Accurate determination of the performance of parallel GPU code typically requires execution-time profiling on target hardware β€” an increasingly prohibitive step due to limited access to high-…

hgpu.org

πŸ“’ Great news! The IWOMP 2025 Call for Papers deadline is extended to May 23, 2025 (AoE).

Join us in Charlotte, NC, USA (Oct 1–3) for the premier OpenMP event, co-located with #EuroMPI2025.

Topics include offloading, applications, tasking, tools & more.

Papers will be published in LNCS.

Submit here: https://www.iwomp.org/call-for-papers/

#IWOMP2025 #OpenMP #HPC #ParallelProgramming #CFP

IWOMP 2025 - The International Workshop on OpenMP

The premier forum to present and discuss issues, trends, recent research, and results related to parallel programming with OpenMP.

IWOMP

It is really wild how fast one implement stuff on the #GPU using just #openmp
Talking about zero to hero in a couple of weeks. There are some optimizations that involve the #simd directive that is not entirely clear to me when to use them, while #gcc also uses some very non transparent rules to map code to vector #ptx instructions but the gain is worth the pain of not understanding these two (and a few other things)

@openmp_arb

The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.
The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.
The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie. Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.
Bluesky

Bluesky Social
The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.

πŸ†• The latest OpenMP Newsletter is out! It features:

πŸ“’ IWOMP 2025 Call for Papers – Share your research at the premier OpenMP workshop!
🧭 Webinar Series – An exploration of the OpenMP 6.0 roadmap and new features.
πŸ“š New OpenMP Course – Learn the fundamentals.
πŸ› οΈ Intel Fortran Compiler Update – Now with expanded OpenMP 6.0 support!
πŸ”₯ FLOW-3D Release – Welding simulation powered by OpenMP.

Check it out! πŸ‘‰ https://mailchi.mp/5127171ec46d/thanks-for-your-interest-in-openmp-17459423

#OpenMP #HPC #ParallelProgramming #IWOMP #FLOW3D #IntelFortran

Brought together our two #TitanComputers that used to support two different projects to set up a small #MPI / #OpenMP cluster (96 physical cores, 1TB DRAM, 40TB storage). We will probably drop a couple of GPUs inside them later on, but for now, they will support our EMR data analytics on the #CPU
Brought together our two #TitanComputers that used to support two different projects to set up a small #MPI / #OpenMP cluster (96 physical cores, 1TB DRAM, 40TB storage). We will probably drop a couple of GPUs inside them later on, but for now, they will support our EMR data analytics on the #CPU