Alex Cheema (@alexocheema)
AMD Ryzen AI Max+ 시스템 클러스터에서 텐서 병렬화(tensor parallelism)를 성공적으로 운용한 사례를 묻는 질문형 트윗. 작성자는 소프트웨어 지원이 부족하다는 이야기를 들었다며, 왜 그런지와 실제 동작 사례를 궁금해하고 있음.
https://x.com/alexocheema/status/2031007365361770828
#tensorparallelism #amd #ryzenaimax #distributedtraining

Alex Cheema (@alexocheema) on X
Has anyone got tensor parallelism working with clusters of AMD Ryzen AI Max+ systems?
I heard the software support is lacking but curious why that is?
X (formerly Twitter)Akshay (@akshay_pachaar)
딥러닝 모델은 기본 설정으로는 여러 GPU가 있어도 보통 단일 GPU만 사용한다는 지적. 이상적인 학습은 학습 부하를 여러 GPU에 분산하는 것이라며, 다중 GPU 훈련을 위한 네 가지 전략을 그래픽으로 소개한다는 내용(멀티-GPU 분산 학습 기법 소개).
https://x.com/akshay_pachaar/status/2026649685243654194
#multigpu #distributedtraining #gpu #deeplearning

Akshay 🚀 (@akshay_pachaar) on X
By default, deep learning models only utilize a single GPU for training, even if multiple GPUs are available.
An ideal way to train models is to distribute the training workload across multiple GPUs.
The graphic depicts four strategies for multi-GPU training👇
X (formerly Twitter)Avi Chawla (@_avichawla)
Multi-GPU 트레이닝을 위한 4가지 전략을 시각 자료로 설명한 게시물입니다. 대규모 모델 학습에서의 병렬화·데이터/모델 분할·메모리 최적화 등 다양한 멀티-GPU 접근법을 한눈에 비교해 이해를 돕는 내용으로 보입니다.
https://x.com/_avichawla/status/2018935482382684460
#multigpu #distributedtraining #deeplearning #gpu

Avi Chawla (@_avichawla) on X
4 strategies for Multi-GPU training, explained visually:
X (formerly Twitter)Import AI 409: Huawei trains a model on 8,000+ Ascend chips; 32B decentralized training run; and the era of experience and superintelligence
https://importai.substack.com/p/import-ai-409-huawei-trains-a-model #AI #DistributedTraining
Import AI 409: Huawei trains a model on 8,000+ Ascend chips; 32B decentralized training run; and the era of experience and superintelligence
Welcome to Import AI, a newsletter about AI research.
Import AIImport AI 409: Huawei trains a model on 8,000+ Ascend chips; 32B decentralized training run; and the era of experience and superintelligence
https://importai.substack.com/p/import-ai-409-huawei-trains-a-model #AI #DistributedTraining
Import AI 409: Huawei trains a model on 8,000+ Ascend chips; 32B decentralized training run; and the era of experience and superintelligence
Welcome to Import AI, a newsletter about AI research.
Import AIImport AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba's good translation model
https://importai.substack.com/p/import-ai-404-scaling-laws-for-distributed #AI #DistributedTraining 
Import AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba's good translation model
How much could you get done if there were one million copies of you?
Import AIImport AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba's good translation model
https://importai.substack.com/p/import-ai-404-scaling-laws-for-distributed #AI #DistributedTraining
Import AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba's good translation model
How much could you get done if there were one million copies of you?
Import AI
Import AI 380: Distributed 1.3bn parameter LLM; math AI; and why reality is hard for Ai
What is our responsibility to machines that may become moral patients?
Import AI