Mastodawn

Can Large Language Models Predict Parallel Code Performance?

#CUDA #OpenMP #LLM #Performance #Benchmarking #Package

Can Large Language Models Predict Parallel Code Performance?

Accurate determination of the performance of parallel GPU code typically requires execution-time profiling on target hardware — an increasingly prohibitive step due to limited access to high-…

hgpu.org

OpenMP ARB May 12

📢 Great news! The IWOMP 2025 Call for Papers deadline is extended to May 23, 2025 (AoE).

Join us in Charlotte, NC, USA (Oct 1–3) for the premier OpenMP event, co-located with #EuroMPI2025.

Topics include offloading, applications, tasking, tools & more.

Papers will be published in LNCS.

Submit here: https://www.iwomp.org/call-for-papers/

#IWOMP2025 #OpenMP #HPC #ParallelProgramming #CFP

IWOMP 2025 - The International Workshop on OpenMP

The premier forum to present and discuss issues, trends, recent research, and results related to parallel programming with OpenMP.

IWOMP

Christos Argyropoulos May 4

It is really wild how fast one implement stuff on the #GPU using just #openmp
Talking about zero to hero in a couple of weeks. There are some optimizations that involve the #simd directive that is not entirely clear to me when to use them, while #gcc also uses some very non transparent rules to map code to vector #ptx instructions but the gain is worth the pain of not understanding these two (and a few other things)

@openmp_arb

Christos Argyropoulos MD, PhD Apr 27

The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.

Christos Argyropoulos Apr 27

The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.

Christos Argyropoulos MD, PhD, FASN 🇺🇸Apr 27

The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie. Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.

Bluesky

Bluesky Social

Christos Argyropoulos MD PhD Apr 27

The #AIs (Claude 3.7 Sonnet Thinking in this case) should be paying me for the humblepie.
Ingredients of the pie: #OpenMP acceleration of an non-LLM algo in the #GPU. The little turd (molested by #Python noise) even suggested that indentation of a C program was the problem.

OpenMP ARB Apr 25

🆕 The latest OpenMP Newsletter is out! It features:

📢 IWOMP 2025 Call for Papers – Share your research at the premier OpenMP workshop!
🧭 Webinar Series – An exploration of the OpenMP 6.0 roadmap and new features.
📚 New OpenMP Course – Learn the fundamentals.
🛠️ Intel Fortran Compiler Update – Now with expanded OpenMP 6.0 support!
🔥 FLOW-3D Release – Welding simulation powered by OpenMP.

Check it out! 👉 https://mailchi.mp/5127171ec46d/thanks-for-your-interest-in-openmp-17459423

#OpenMP #HPC #ParallelProgramming #IWOMP #FLOW3D #IntelFortran

Christos Argyropoulos MD, PhD Apr 21

Brought together our two #TitanComputers that used to support two different projects to set up a small #MPI / #OpenMP cluster (96 physical cores, 1TB DRAM, 40TB storage). We will probably drop a couple of GPUs inside them later on, but for now, they will support our EMR data analytics on the #CPU

Christos Argyropoulos Apr 21

Brought together our two #TitanComputers that used to support two different projects to set up a small #MPI / #OpenMP cluster (96 physical cores, 1TB DRAM, 40TB storage). We will probably drop a couple of GPUs inside them later on, but for now, they will support our EMR data analytics on the #CPU