Confidential Computing on Heterogeneous Systems: Survey and Implications
#CUDA #FPGA #Security #Review #HeterogeneousSystems
https://hgpu.org/?p=29360

Confidential Computing on Heterogeneous Systems: Survey and Implications
In recent years, the widespread informatization and rapid data explosion have increased the demand for high-performance heterogeneous systems that integrate multiple computing cores such as CPUs, G…
hgpu.orgHelix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
#HeterogeneousSystems #GPUcluster #LLM
https://hgpu.org/?p=29242

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
This paper introduces Helix, a distributed system for high-throughput, low-latency large language model (LLM) serving on heterogeneous GPU clusters. A key idea behind Helix is to formulate inferenc…
hgpu.org