Directly found the first issue im #FluidX3D: #OpenCL __attribute__((opencl_unroll_hint(x))) is not supported on AMD's old #GPU driver. Guaranteeing legacy hardware compatibility means testing on legacy hardware. I'm not giving software rot a chance! 🖖🧐
https://github.com/ProjectPhysX/FluidX3D/commit/b4481ae5bdc96fb062a6db3092af3470e50ccab0
Removed opencl_unroll_hint again (unsupported on some older GPUs), ad… · ProjectPhysX/FluidX3D@b4481ae

…ded more benchmarks in Readme

GitHub

#FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
https://github.com/ProjectPhysX/FluidX3D/releases/tag/v3.7

Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
https://github.com/ProjectPhysX/FluidX3D/blob/master/src/kernel.cpp#L2827-L2956

PS: plugged X-wing Gif in #GitHub preview 🖖😜

Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
#FluidX3D benchmarks here: https://github.com/ProjectPhysX/FluidX3D#single-gpucpu-benchmarks
And the #OpenCL infos:
- Arc B390: https://opencl.gpuinfo.org/displayreport.php?id=6718
- Core Ultra X7 358H: https://opencl.gpuinfo.org/displayreport.php?id=6717
#FluidX3D #CFD has reached ⭐ 5000 Stargazers on #GitHub! 🖖🥳
Grid refinement update is still in development, I haven't forgotten... ⬜◻️◽▫️
https://github.com/ProjectPhysX/FluidX3D

Finally Intel #GPU support on Linux too. Watch all the metrics go brrr in multi-GPU #FluidX3D #CFD workload! Will #opensource soon™️

Hardening against the myriads of broken counters in all those bugged APIs was a long shot. 🖖🫠

____________ | Windows | #Linux |
CPU / RAM | ✅️️WinAPI | ✅️️/proc |
#Nvidia GPU | ✅️️NVML | ✅️️NVML |
#Intel GPU | ✅IGCL | ✅SYSMAN |
#AMD GPU | ✅️️️️ADLX | ✅️️️️AMDSMI |

#FluidX3D #CFD v3.6 is out! This release accumulates a number of small improvements over the last months. Most notably, better interactive graphics support on #macOS with XQuartz. Have fun! 🖖😎🌊🍏
https://github.com/ProjectPhysX/FluidX3D/releases/tag/v3.6
Release FluidX3D v3.6 (improved macOS graphics) · ProjectPhysX/FluidX3D

FluidX3D v3.6 accumulates a number of small improvements over the last months. Most notably, better interactive graphics support on macOS. Improvements improved INTERACTIVE_GRAPHICS support on ma...

GitHub
Finally, the bars for #Intel hardware in the #FluidX3D #mermaid gantt performance chart are BLUE, as nature intended it.
https://github.com/ProjectPhysX/FluidX3D?tab=readme-ov-file#single-gpucpu-benchmarks
Some experimentation with ```mermaid ...``` charts in #GitHub #markdown. Turns out you can hack the formatting on the quadrantChart to turn it into an xy-scatter plot with individual point size/coloring/labeling.🖖🧐
First plot is datasheet memory bandwidth vs. FP32 TFlops/s, second plot is #FluidX3D performance vs. bandwidth, for lots of #GPU​/​#CPU hardware.
Here's me demoing #Intel Arc Pro B60 #GPU workstations at #SC25 in St. Louis, runnig SolidWorks and #FluidX3D! 🖖😎
https://www.youtube.com/watch?v=Z8yxiyXTi7I
Unleashing Workstations with Intel® Arc™ Pro B-Series Graphics | Intel Business

YouTube
Making progress on >top secret #FluidX3D update< but still a long way to go 🖖🧐