Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

Production-Grade LLM Inference at Scale with KServe, llm-d, and vLLM | llm-d
How migrating from a simple vLLM deployment to a robust MLOps platform utilizing KServe, llm-d's intelligent routing, and vLLM solved significant scaling and operational challenges in LLM deployment through deep customization and prefix-cache aware routing to maximize GPU utilization.
llm-d
233% 3-year return on investment and 13 months to payback with Red Hat AI
Discover the financial benefits and return on investment (ROI) experienced by customers using Red Hat AI. Learn how organizations turned infrastructure challenges into measurable financial gains with a 3-year ROI of 233% and a 13-month payback period.
Today we announce the General Availability of AI Quickstarts! Get started quickly with your usecase and solve real business problems using Red Hat AI rapidly!
https://docs.redhat.com/en/learn/ai-quickstarts
#RedHat #AI #RedHatAI #OpenShift #OpenShiftAI #RHEL #RHELAI #OpenSource #OpenSourceAI
AI quickstarts | Red Hat Documentation
Red Hat AI | 3 | Red Hat Documentation
Red Hat AI | 3 | Red Hat Documentation

KServe joins CNCF as an incubating project
KServe, the leading standardized AI inference platform on Kubernetes, has been accepted as an incubating project by the Cloud Native Computing Foundation (CNCF).

3 things to know about Red Hat AI 3
YouTube
Red Hat Brings Distributed AI Inference to Production AI Workloads with Red Hat AI 3
Red Hat Brings Distributed AI Inference to Production AI Workloads with Red Hat AI 3

Red Hat OpenShift AI achieves ISO 42001 AI certification
Learn how Red Hat OpenShift AI's ISO 42001 certification reinforces Red Hat's leadership in responsible AI, providing enhanced customer data protection, industry standard alignment, and platform maturity.