Alex Cheema (@alexocheema)

RDMA 불필요 주장: prefill/Decode 분해(분산·디스어그리게이션)는 대기시간(latency)에 민감하지 않아 고가의 RDMA 대신 10GbE로 충분하다는 기술적 분석과 결과를 공유했다는 내용. 네트워크 아키텍처 선택과 비용·성능 트레이드오프에 대한 인프라 논의임.

https://x.com/alexocheema/status/2027830902487707843

#rdma #10gbe #disaggregation #inference

Alex Cheema (@alexocheema) on X

@lmc_security @exolabs Not RDMA. Prefill/Decode disaggregation is not latency sensitive so RDMA is not necessary. It's 10GbE. We wrote up the analysis / results here: https://t.co/KGOfjRIJ9c

X (formerly Twitter)

Anyscale công bố: >50% cụm AI chỉ sử dụng <50% GPU do tải công việc gián đoạn. Giải pháp Ray: tách CPU và GPU (disaggregation) để tối ưu tài nguyên. Tuy nhiên, một số cho rằng đây là quá kỹ thuật; nếu mô hình 70B tải <2 s (ephemeral) thì GPU có thể tắt hoàn toàn, giảm chi phí. Bạn ưu tiên tối đa hoá sử dụng hay tải nhanh? #AI #GPU #Disaggregation #Ray #Ephemeral #TríTuệNhânTạo

https://www.reddit.com/r/LocalLLaMA/comments/1qjbufk/anyscales_new_data_most_ai_clusters_run_at_50/

AI models keep getting faster, but your infrastructure isn’t keeping up.

As LLMs power everything from customer support to enterprise search, monolithic server setups are turning into major bottlenecks.

Could #Disaggregation be the answer?

📰 Dive into the #InfoQ article to learn more: https://bit.ly/48987Em

#AI #LLMs #Infrastructure #Frameworks

ICYMI: Wenn ihr wissen wollt, was FrauStief_in_IT diese Woche so alles auf der #ISC23 in Hamburg gesehen hat, hört doch mal in unseren data://express #Podcast rein.

In der aktuellen Episode geht es um #quantencomputer #liquidcooling und #disaggregation

https://data-express.letscast.fm/episode/dxprs0039-it-on-steroids-auf-der-isc-hpc

DXPRS0039: IT on Steroids auf der ISC HPC

FrauStief_in_IT hat die ISC HPC in Hamburg besucht und leuchtende Augen bekommen ob all der Quanten, Quantenrechner, Flüssigkühlern und Network Accelerators.

data://express
Unsere Chefredaktion Kerstin weilt dieser Tage in Hamburg auf der ISC High Performance, das was man gemeinhin als Supercomputer versteht. Sie hat sich unter anderem umgesehen und -gehört bei Herstellern von Quantum-Computern, bei schnellen Netzwerkkomponenten und sich zu Disaggregation per CXL schlau gemacht. Ihre Eindrücke könnt ihr in der neuesten Episode data://express nachhören: https://data-express.letscast.fm/episode/dxprs0039-it-on-steroids-auf-der-isc-hpc #quantencomputing #liquidcooling #disaggregation #cxl
DXPRS0039: IT on Steroids auf der ISC HPC

FrauStief_in_IT hat die ISC HPC in Hamburg besucht und leuchtende Augen bekommen ob all der Quanten, Quantenrechner, Flüssigkühlern und Network Accelerators.

data://express
@5SpeedFun I haven’t tried it myself, but I hear that current OpenWRT builds have VRF support and, coincidentally, a number of EdgeRouters are on their HCL. #Disaggregation
#Disaggregation was a safety necessity of the #pandemic- water, food, #PPE, masks and #vaccines became life and death issue without local control. However the trend is in the other direction. Albertson and Safeway's merger closed 100 stores and laid of 8000. The Kroeger Albertson consolidation is promising jobs and more stores, to avoid restrictions, and the bipartisan regulators believe them! https://www.levernews.com/email/dfa8f40d-43bd-4605-a63f-31c7290d481c/?ref=podcasts-newsletter
The Corporate Merger No One’s Talking About

On this week’s Lever podcasts: David explores the ramifications of the potential Kroger/Albertsons merger; and The Audit study group completes Axelrod and Rove’s campaign strategy MasterClass.

The Lever