Mastodawn

New research shows a tuned recommendation engine can boost click‑through rates by 10% while cutting inference cost. The paper dives into model‑serving tricks, optimization for large language models, and deployment efficiency for production AI. Open‑source practitioners will love the practical benchmarks. #RecommendationEngine #InferenceOptimization #ModelServing #ClickThroughRate

🔗 https://aidailypost.com/news/recommendation-engine-lifts-click-through-10-efficiency-needed

AI Daily Post Nov 25, 2025

Google’s new Ironwood TPU is purpose‑built for inference, delivering ultra‑low latency and high‑volume model serving with a novel inter‑chip interconnect. As the industry pivots to edge AI, this hardware could reshape how we deploy models. Dive into the specs and why it matters for open‑source AI projects. #IronwoodTPU #AIInference #LowLatencyAI #ModelServing

🔗 https://aidailypost.com/news/ironwood-tpu-purposebuilt-hardware-inference-industry-shifts-focus

Adam

Nov 11, 2025

KServe joins CNCF as an incubating project

https://www.redhat.com/en/blog/kserve-joins-cncf-incubating-project

#RedHat #Kubernetes #OpenShift #OpenShiftAI #RedHatAI #CNCF #KServe #Inference #ModelServing

KServe joins CNCF as an incubating project

KServe, the leading standardized AI inference platform on Kubernetes, has been accepted as an incubating project by the Cloud Native Computing Foundation (CNCF).

Show thread

Yuan Tang Sep 29, 2025

💙 #KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative #Kubeflow #Kubernetes #k8s @kubefloworg.bsky.social

Show thread

Yuan Tang

Sep 29, 2025

🙌 Huge thanks to everyone who contributed to this journey from writing code, reviewing docs, to supporting governance and community growth.

Stay tuned! We’ll be publishing a detailed announcement blog soon with more insights on what this means for users, contributors, and the future of model serving on Kubernetes.

For now: thank you to the community for making this possible. 💙

#KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative #Kubeflow #Kubernetes #k8s Kubeflow

Show thread

Yuan Tang Sep 9, 2025

This is a big step for the KServe community, and we’re excited about the road ahead in making cloud-native model serving more accessible and production-ready for everyone. #KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative @cncf.io @kubernetes.io @kubefloworg.bsky.social

Show thread

Yuan Tang

Sep 9, 2025

A huge thank you to Kevin Wang and Faseela K from the CNCF TOC for all the hard work. It’s been such a pleasure collaborating with you both on this milestone. Thank you to all the community members who have contributed!

This is a big step for the KServe community, and we’re excited about the road ahead in making cloud-native model serving more accessible and production-ready for everyone.

#KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative CNCF Kubernetes Kubeflow

Show thread

Yuan Tang Aug 11, 2025

Big thanks to everyone contributing code, reviews, and ideas — this integration is shaping up to be a game-changer for 𝗞𝘂𝗯𝗲𝗿𝗻𝗲𝘁𝗲𝘀-𝗻𝗮𝘁𝗶𝘃𝗲 𝗟𝗟𝗠 𝘀𝗲𝗿𝘃𝗶𝗻𝗴. Stay tuned for next release! #KServe #llmd #GenerativeAI #MLOps #Kubernetes #ModelServing #AIInfrastructure

Show thread

Yuan Tang

Aug 11, 2025

#KServe #llmd #GenerativeAI #MLOps #Kubernetes #ModelServing #AIInfrastructure