Confidential Computing on Heterogeneous Systems: Survey and Implications

#CUDA #FPGA #Security #Review #HeterogeneousSystems

https://hgpu.org/?p=29360

Confidential Computing on Heterogeneous Systems: Survey and Implications

In recent years, the widespread informatization and rapid data explosion have increased the demand for high-performance heterogeneous systems that integrate multiple computing cores such as CPUs, G…

hgpu.org

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

#HeterogeneousSystems #GPUcluster #LLM

https://hgpu.org/?p=29242

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

This paper introduces Helix, a distributed system for high-throughput, low-latency large language model (LLM) serving on heterogeneous GPU clusters. A key idea behind Helix is to formulate inferenc…

hgpu.org